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PREFACE 


This  document  is  the  Final  Technical  Report  on  work  performed  by  Operating  Systems, 
Inc.  under  RADC  Contract  No.  F30602-80-C-0047,  entitled  AtS  Data  Base  Generation . 

All  examples  of  messages  and  message  fragments  (sentences)  in  the  body  of  this 
document  are  based  upon  facts  and  events  reported  in  the  daily  press  —  specifi¬ 
cally,  the  Los  Angeles  Times  (July  1979)  —  Aviation  Week  (February  1973  to  July 
1 979),  American  and  Soviet  Military  Trends  since  the  Cuban  Missile  Crisis  by  John  M. 
Collins,  published  by  The  Center  for  Strategic  and  International  Studies,  Georgetown 
University,  Washington,  D.C.,  1978,  and  Jane’s  All  the  World's  Aircraft,  1976-77. 

The  messages  used  as  examples  resemble  actual  messages  only  in  their  grammatical 
structure.  Most  objects,  attributes,  and  time/location  parameters  used  are  related  to 
the  events  connected  with  the  launch,  mission,  deorbit,  breakup,  and  impact  of 
Skylab. 
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ABSTRACT 


This  final  report  presents  the  results  of  work  performed  under  RADC  contract  No. 
F30602-80-C-0047.  The  effort  described  involved  the  further  development  of  OSVs 
automated  data  base  generation  technology  and  its  implementation  in  a  computational 
environment  suitable  for  further  exploration  and  enhancement  of  the  algorithms. 

The  report  begins  with  a  brief  discussion  of  the  intelligence  problem  which  05 Ts 
message  processing  technology  intends  to  solve,  and  goes  on  to  summarize  *.ha 
development  of  the  technology  thus  far.  Section  2  offers  a  summary  of  OSI’s  metho¬ 
dological  approach  to  the  analysis  and  description  of  event  reports.  This  methodol¬ 
ogy,  initially  developed  on  the  basis  of  messages  dealing  with  air  activities,  has  been 
extended  to  cover  reports  of  events  involving  missile  and  satellite  launchings  and 
related  events.  Section  2.1  offers  an  overview  of  related  efforts  in  the  area  of 
natural  language  understanding  and  complex  information  processing.  Section  2.2 
describes  OSi’s  orientation  to  the  problem.  Section  2.3  describes  our  particular  for¬ 
malism  for  knowledge  representation,  which  forms  the  core  of  our  methodology.  Sec¬ 
tion  3  covers  the  message  processing  and  event  record  generation  algorithms  used 
in  the  current  system,  MATRES  III.  It  gives  an  overview  of  the  processing  from  raw 
text  input  to  final  output  of  the  event  record.  Section  4  presents  a  model  data  base 
system  of  event  records  and  describes  an  experimental  program  for  retrieving  these 
event  records.  Section  5  looks  to  the  future  of  the  message  processing  technology, 
and  considers  further  development  in  two  directions:  building  a  production  system 
embodying  the  algorithms  developed  thus  far,  and  further  enhancement  of  the  pro¬ 
cessing  power  of  the  algorithms.  Examples  of  the  kind  of  message  segments  that 
the  system  can  handle  are  presented  in  Appendix  A,  and  some  sample  event  records 
synthesized  from  these  segments  are  given  in  Appendix  B.  The  detailed  design, 
structures,  and  algorithms  of  the  MATRES  III  system  are  described  in  a  companion 
report  [Dwiggir.*  and  Silva  1981]. 
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1.0  INTRODUCTION  AND  SUMMARY 
1.1  Introduction 

This  final  report  presents  the  results  of  work  performed  under  RADC  contract  No. 
F30602-80-C-0047.  The  work  performed  under  this  contract  involved  the  further 
development  of  0SI*s  automated  data  base  generation  technology  and  its  implemen¬ 
tation  in  a  computational  environment  suitable  for  further  exploration  and  enhance¬ 
ment  of  the  algorithms.  The  following  sections  briefly  discuss  the  intelligence  prob- 
iom  which  OSl’s  message  processing  technology  intends  to  solve  (1.1.1.),  and  sum¬ 
marize  the  development  of  the  technology  thus  far  (1.1.2.) 

7.7.7  Problem  Statement .  The  task  of  an  intelligence  analyst  is  to  predict  the  future 
on  the  basis  of  information  describing  what  has  happened  in  the  past  and  what 
events  are  currently  taking  place. 

At  the  global  level,  the  questions  the  analyst  asks  himself  are:  "What  is  happening?*' 
'•What  does  it  mean  in  terms  of  my  knowledge  about  similar  events  in  the  past?**, 
‘•What  is  going  to  happen  next"?  He  is  concerned  with  certain  states  of  affairs,  and 
events  signifying  changes  in  these  states  of  affairs . 

When  working  with  a  single  message,  the  analyst  seeks  answers  to  at  last  the  fol¬ 
lowing  questions: 

1 .  What  is  its  information  content? 

2.  How  reliable  is  the  source? 

3.  How  "credible"  is  the  data? 

His  evaluations  of  incoming  information  are  based  on  his  cognitive  models  of  certain 
kinds  of  situations,  the  personalities,  entities,  and  processes  involved,  and  the 
potentialities  and  constraints  associated  with  changes  in  an  existing  state  of  affairs. 

Given  the  volume  of  information  he  must  sift,  and  the  complexity  of  the  cognitive 
models  Involved,  the  difficulties  of  the  analyst's  task  are  obvious.  Aids  to  support  his 
analytical  processes  clearly  must  involve  means  for  distilling  the  content  of  incoming 
information  into  a  form  which  is  compact,  usable,  and  compatible  with  his  view  of  the 
world. 

Information  on  the  world  situation  comes  to  the  l&W  analyst  mainly  in  the  form  of 
intelligence  messages,  which  are  electrically  received  in  an  l&W  center  24  hours  a 
day.  The  messages  come  from  many  different  originators,  and  are  largely  in  the  form 
of  narrative  text.  The  volume  of  message  traffic  is  extremely  high,  and  in  a  crisis 
situation,  increases  dramatically.  Even  under  normal  operating  conditions  it  is  very 
difficult  for  an  analyst  to  isolate  items  of  information  from  message  text  and  to 
assimilate  and  correlate  these  items  into  a  pattern  of  events  of  indications  signifi¬ 
cance.  in  a  crisis  situation,  the  analyst  is  completely  saturated  with  data,  and  the 
performance  of  his  task  demands  superhuman  capabilities  for  handling  the  enormous 
amount  of  information  which  is  contained  in  the  message  traffic. 

A  computer,  on  the  other  hand,  can  process  large  amounts  of  information.  Thus,  the 
notion  of  offloading  some  of  the  processing  functions  onto  the  machine  seems  to  pro¬ 
vide  a  logical  solution  to  the  information  problem. 
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In  order  for  the  computer  to  process  the  contents  of  a  message  as  information,  how¬ 
ever,  that  information  must  be  extracted  from  the  text  of  the  message.  Currently, 
that  information  must  be  extracted  manually  for  input  to  an  analysis  system,  which  is 
a  tedious  and  error-prone  task,  particularly  when  performed  by  an  analyst  with  other 
important  demands  on  his  time. 

7.7.2  Toward  a  So/ut/on.  For  the  past  several  years  RADC  has  been  sponsoring  an 
exploratory  and  developmental  program  related  to  the  design  and  development  of  a 
general  methodology  for  the  efficient  and  effective  exploitation  of  the  content  of 
electrically  transmitted  intelligence  messages.  The  long  term  goal  of  this  work  is  to 
develop  a  system  which  wouid  assist  the  analyst  in  creating  and  maintaining  format 
ted  data  bases  derived  from  natural  language  text,  and  thus  offload  some  of  the  re¬ 
cessing  functions  from  the  analyst  to  the  computer.  Such  a  system  should  prcviae 
the  analyst  with  information  which  is  needed  for  the  attainment  of  his  particular  goal, 
La.,  Information  which  is  relevant  to  his  task,  is  of  high  apistemic  standing,  and  there¬ 
fore  useful  to  solving  his  problems. 

As  mentioned  above,  the  work  described  in  this  report  is  concerned  with  the  analysis 
of  textual  reports  of  events  and  the  synthesis  of  relevant  information  elements  in  a 
format  suitable  for  automated  input  to  a  data  base  and/or  analysis  system. 

Specifically,  the  program  addresses  the  problem  of  automating  the  analysis  of  the 
narrative  text  portions  of  intelligence  messages  describing  events,  with  the  aim  of 
transforming  them  into  succinct,  formatted,  computer  processable  content  represen¬ 
tations. 

The  automated  generation  of  information  elements  from  narrative  message  text 
requires  that  the  computer  in  some  sense  "understand"  natural  language  text. 
Within  the  context  of  the  work  described  here,  we  say  that  a  computer  system 
understands  an  input  text  insofar  as  it  can  construct  an  adequate  representation  of 
the  Information  content  of  that  text.  Specifically,  we  require  that  the  output  of  the 
computer  understanding  process,  when  applied  to  some  message  text,  furnish  the 
analyst  with  at  least  those  information  elements  that  he  would  himself  have 
extracted  from  that  particular  text. 

The  aim  is  to  model  the  cognitive  activities  of  the  human  analyst  as  he  reads  and 
understands  message  text,  distilling  its  contents  into  information  items  of  interest  to 
him,  and  building  a  conceptual  model  of  the  information  conveyed  by  the  message 

In  order  to  model  this  human  cognitive  activity,  the  computer  must  be  equipped  with 
representations  of  both  linguistic  and  extra-linguistic  knowledge,  and  a  means  of 
manipulating  such  representations  for  the  analysis  of  text  and  synthesis  of  informa¬ 
tion  elements.  The  elements  must  then  be  presented  in  a  clear  and  useful  format 
suitable  for  the  task  at  hand. 

1.2  Summery 

The  MATRES  II  system  which  resulted  from  OSfs  earlier  efforts  in  this  area  was 
developed  in  a  hardware/software  environment  offering  only  limited  capabilities  for 
the  kind  of  development  programming  needed  in  such  a  task.  In  the  course  of 
developing  that  system,  we  became  familiar  with  the  concept  of  logic  programming 
[Kowalski  1979]  and  the  Prolog  language  as  an  embodiment  of  that  concept  [Warren 
et  al  1977,  Pereira  et  al  1978].  We  used  the  Prolog  formalism  as  the  oasis  for 
implementing  our  event  record  generation  algorithms. 
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Since  then,  a  Prolog  system  became  available  to  us,  and  we  learned  of  a  form  of 
grammar  (called  "definite  clause  grammar")  which  is  both  natural  to  Prolog  and  quite 
powerful  enough  to  express  sophisticated  English  language  grammars  [Pereira  and 
Warren  1980],  Using  these  tools,  we  have  developed  and  extended  the  earlier 
algorithms  into  a  system  that  works  in  a  unified  environment,  which  is  capable  of  pro¬ 
cessing  amounts  of  data  large  enough  to  do  serious  development  of  the  algorithms, 
and  which  is  capable  of  supporting  significant  development  without  the  limitations 
and  difficulties  associated  with  the  earlier  development  environment.  We  also  took 
advantage  of  Prolog’s  dual  status  as  programming  language  and  data  base  system  to 
a  simple  model  of  the  kind  of  storage  and  retrieval  capability  that  can  make  use 
o.  the  event  records  generated  by  our  technology. 

There  are  four  major  sections  to  this  report.  Section  2  offers  a  summary  of  OSl’s 
methodological  approach  to  the  analysis  and  description  of  event  reports.  This 
methodology,  initially  developed  on  the  basis  of  messages  dealing  with  air  activities, 
has  been  extended  to  cover  reports  of  events  involving  missile  and  satellite  launch¬ 
ings  and  related  events.  Section  2.1  offers  an  overview  of  related  efforts  in  the 
area  of  natural  language  understanding  and  complex  information  processing.  Section 
2.2  describes  OS Ps  orientation  to  the  problem.  Section  2.3  describes  our  particular 
formalism  for  knowledge  representation,  which  forms  the  core  of  our  methodology. 

Section  3  covers  the  message  processing  and  event  record  generation  algorithms 
used  in  the  current  system,  MATRES  III.  It  gives  an  overview  of  the  processing  from 
raw  text  input  to  final  output  of  the  event  record. 

Section  4  presents  the  model  data  base  system  that  demonstrates  a  kind  of  retrieval 
that  could  be  appropriate  to  an  event  analysis  system  as  described  above. 

Section  5  looks  to  the  future  of  the  message  processing  technology,  and  considers 
further  development  in  two  directions:  building  a  production  system  embodying  the 
algorithms  developed  thus  far,  and  further  enhancement  of  the  processing  power  of 
the  algorithms. 

Examples  of  the  kind  of  sentences  that  the  system  can  handle  are  presented  in 
Appendix  A,  and  some  sample  event  records  from  some  of  these  sentences  are  given 
in  Appendix  B. 

The  detailed  design,  structures,  and  algorithms  of  the  MATRES  III  system  are 
Scribed  in  a  companion  report  [Dwigglns  and  Silva  1981], 
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2.0  MESSAGE  TEXT  ANALYSIS  CONCEPTS 
2. 1  Introduction 

Thu  section  describes  OSI’s  automated  data  base  generation  technology.  This  tech¬ 
nology  has  grown  out  of  several  RADC-sponsored  exploratory  research  efforts 
directed  toward  the  development  of  a  general  methodology  for  both  interactive  and 
fully  automated  exploitation  of  the  content  of  electrically  transmitted  intelligence 
messages  in  order  to  create  and  maintain  formatted  indications  and  warnings  (;£W) 
bases.  The  motivation  for  this  work  arises  from  the  acute  need  to  assist  the 
!&W  analyst  in  dealing  with  message  traffic,  particularly  in  an  information  overioao 
situation. 

Daily  message  traffic  consists  of  messages  in  which  the  information  can  be  format¬ 
ted,  (in  the  form  of  tables),  semi-formatted  (a  mixture  of  tables  and  narrative  text), 
or  unformatted,  in  which  case  ail  the  information  reported  <s  in  the  form  of  narrative 
text.  A  design  concept  for  the  automated  analysis  of  formatted  messages  and  tech¬ 
niques  for  their  conversion  into  structured  data  base  elements  tor  use  in  l&'vV  hOP 
systems  was  developed  by  Montgomery  ana  :s  described  in  Silva  ana  Montgomery 
[197?’],  voi.  II.  While  the  difficulties  involved  tne  synthesis  of  data  base  elements 
Trom  formatted  messages  should  not  be  underestimated,  the  procedures  required  are 
wait  within  the  state-of-the-art.  The  interpretation  of  unformatted  messages,  on  the 
other  hand,  requires  the  utilization  of  very  advanced  and  sophisticated  analytical 
techniques  aimed  at  a  ’’computer  understanding4’  of  natural  language  text,  for  which 
there  are  no  off-the-shelf,  generally  applicable  working  Algorithms. 

Most  active  researchers  in  the  area  of  natural  language  understanding  by  computer 
come  from  the  fields  of  Computational  Linguistics  and  Artificial  Intelligence  (Al).  in 
the  last  decade  an  impressive  array  of  experimental  language  understanding  sys¬ 
tems  have  been  built  and  applied  to  various  tasks.  These  systems  take  fragments  of 
English  text  as  input,  and  provide  a  representation  of  the  meaning  content  of  that 
text  as  output. 

There  exist  several  comprehensive  reviews  of  natural  language  understanding  pro¬ 
grams  and  systems.  For  example,  Wilks  [1974]  surveys  and  compares  the  major 
projects  or  the  understanding  of  natural  language  as  they  existed  at  that  time. 
Inr*  dad  in  the  discussion  are  Winograd's  SHRDLU,  Charmak’s  work  on  the  resolution 
ui  pronoun  ambiguities  in  children’s  stories,  Colby’s  PARRY,  Simmons’  work  on  seman¬ 
tic  networks,  Schank’s  MARGIE,  and  Wiiks;  own  work,  as  reflected  in  an  Engiish-to- 
French  translation  system.  Another,  more  recent  survey,  is  that  by  Kender  [1977], 
who  provides  us  with  an  excellent  Annotated  Bibliography  of  Natural  Language  and 
Speech  Understanding  Systems ,  lender’s  bibliography  summarizes  some  80  papers  in 
the  field,  and  also  includes  several  overviews  and  criticisms,  usually  in  the  form  of 
comparative  studies. 

Another  area  which  contributes  to  and  benefits  from  research  in  natural  language 
understanding  is  that  of  "expert"  systems.  These  are  computer  programs  designed 
to  solve  specific  technical  problems,  perform  medical  diagnoses,  provide  education, 
training  or  guidance,  and  act  as  "assistants"  to  humans  engaged  in  some  task.  Many 
of  them  have  natural  language  understanding  components  which  allow  users  to  con¬ 
verse  with  the  system  in  a  natural  way  during  the  performance  of  some  task  (c.f. 
[Woods  1977],  [Bobrow,  D.  G.  et  4i.  1977];  and  [Brown  et  al.  1975]). 
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A  list  ot  "  'stem-  md  program s  wnich  allow  natural  fanguage  commur-  .  *u^  /dr 
computer  s'ieciTicallv  fur  the  purpose  of  data  base  quc  /  ^cer. tiv  ccnr.p..r_d  sy 
Wait-!  ^977],  and  published  in  a  special  issue  of  the  SIGAPf  A/eivs/erfe.  ,  devoted  to 
Natural  uangvage  Interfaces. 

Research  efforts  specifically  concerned  with  the  automated  creation  of  structure  - 
databases  from  narrative  text,  however,  seem  to  be  few  and  restricteo  ro  applica¬ 
tions  in  the  field  of  medicine.  The  best-known  effort  in  th.s  area  is  the  ongoing 
research  ct  the  Linguistic  String  Project  at  the  New  York  University  unaer  tne 
leadership  of  Naomi  Sager  The  Linguistic  String  Project  js  concerned  with 
development  of  techniques  for  converting  medical  records  written  in  £nq(irl 
tabular  structures  called  "information  formats"  from  which  it  will  be  possioi«- 
gather  statistics  and  answer  questions  ([Hirschman  et  al.  1976],  [Grishman  et  aT 
"378],  and  [Sager  1978]).  In  this  method,  input  sentences  are  subjected  tc  an  ela¬ 
borate  syntactic  analysis,  followed  by  the  application  of  "formatting  transformations'’ 
which  fin  in  the  rows  or  the  tabuiar  '‘information  formats"  mentioned  above.  The 
latter  constitute  ;he  building  blocks  of  the  data  base. 

Another  interesting  approach  to  automated  database  generation  is  the  one  taken  by 
the  Heurtsr.c  Frogrammmg  Project  at  Stanford  University  Bonnet  [1979]  describes  a 
computer  program  designed  to  understand  medical  summaries  describing  the  status 
of  patients,  in  his  system,  "computer  understanding"  is  guided  bv  abstract  data 
structures  called  ’schemas",  which  model  the  way  physicians  present  medical  prob¬ 
lems  in  their  summaries. 

The  next  section  defines  QSVs  orientation  to  the  problem  of  automated  data  base 
generation  from  narrative  text,  and  examines  the  concept  of  a  "computer  under¬ 
standing",  which  underlies  OSTs  technology.  Section  2.3  examines  OS!  s  approach  to 
the  representation  of  knowledge  in  the  light  of  the  intended  application,  and 
discusses  two  data  structures,  the  template  and  the  event  record,  which  play  an 
important  role  in  the  representation  of  knowledge  about  events  and  the  transforma¬ 
tion  of  narrative  message  text  into  conceptually  structured  data  elements.  OSTs 
methodological  approach  is  implemented  in  a  computer  system,  called  MATRES,  which 
serves  as  a  testbed  for  the  continuing  development  of  the  procedures  required  fc; 
automated  generation  of  structured  data  elements  from  the  narrative  text  of  intelli¬ 
gence  messages.  A  detailed  functional  description  of  MATRES  is  offered  n  Section 
T.  Briefly,  MATRES  takes  sentences  of  messages  as  input,  analyzes  and  interprets 
each  such  sentence,  identifies  information  elements  relevant,  to  a  given  task  v-d 
organizes  those  elements  into  compact,  computer-processable  content  representa¬ 
tions  suitable  for  input  to  and  update  of  l&W  data  bases.  The  current  vers  n  jt 
MATPES  >s  implemented  in  the  programming  language  Prolog,  and  runs  3n  a  PDP- 
J  1/70  under  the  Unix  operating  system. 

2,2  OSi’s  Orientation 

Operating  Systems'  approach  to  automated  database  generation  is  aimed  toward  rl,e 
3'jpport  nformatlon  processing  functions  in  the  intelligence  community  WhTe 
f;rmly  committed  to  the  utilization  of  fhe  most  advanced  techniques,  OSi  <s  highly 
veiective  in  its  approach  to  the  solution  of  given  problems.  OSTs  policy  has  always 
to  search  for  the  mo3t  sensible  and  productive  ways  to  aid  and  support  toe 
fit  n : v  j fJ s  mission  '■Qlatrve  to  his  particular  environment  and  the  ty  pe  of  lata  with 
wt.  ch  he  works.  Applicable  information  processing  techniques,  therefore,  are 


evaluated  in  the  light  of  the  needs  and  requirements  of  the  particular  intelligence 
activities  they  are  to  serve. 

The  OSI  approach  integrates  considerable  practical  experience  gair.od  from  many 
years  of  involvement  with  the  design  and  development  of  sophisticated  information- 
processing  tools  for  the  intelligence  community  with  theoretical  and  practical 
advances  in  several  disciplines,  including  linguistics,  computational  linguistics,  artifi¬ 
cial  intelligence,  and  advanced  database  technology. 

One  of  the  key  concepts  underlying  OSI's  automated  data  base  generation  technol¬ 
ogy  is  that  of  ‘'computer  understanding"  of  message  text.  For  the  purposes  of  "his 
work,  we  say  that  a  computer  system  "understands"  a  message  text  insofar  as  it 
can  construct  a  representation  of  the  meaning  content  of  that  text  which  satisfies 
the  information  requirements  of  the  prospective  class  of  users.  Specifically,  we 
require  that  the  output  of  the  understanding  process  furnish  the  analyst  with  at  least 
tnose  information  items  that  he  would  himself  have  extracted  from  that  text  during 
t ha  performance  of  a  particular  task. 

To  achieve  a  "computer  understanding"  of  message  text,  an  attempt  must  be  made 
to  model  the  human  processes  of  assimilation,  i.e.,  the  computer  must  in  some  sense 
be  made  to  emulate  the  mental  processes  of  an  analyst  scanning  a  text  and  distilling 
lh*om  it  those  items  of  information  which  are  compatible  with  his  current  requirements, 
and  therefore  relevant  and  useful  to  the  attainment  of  his  goal. 

Understanding  language  is  a  very  complex  intellectual  process,  and  the  knowledge  of 
the  subprocesses  involved  is  at  best  fragmentary.  Nevertheless,  recent  theoretical 
and  methodological  advances  in  Cognitive  Science  —  a  new  science  which  combines 
theoretical  and  empirical  methodologies  from  computational  linguistics,  artificial  intelli¬ 
gence,  philosophy,  cognitive  psychology,  computer  science  and  education  —  have  led 
to  a  number  of  significant  insights  which  together  shed  enough  light  upon  the  nature 
of  these  processes  that  meaningful  investigations  can  be  undertaken.  The  next  few 
paragraphs  give  a  brief  sketch  of  the  theory  of  comprehension  adopted  for  the  pur¬ 
poses  of  this  work. 

This  theory  is  based  upon  the  assumption  that,  when  people  interpret  text,  they  do 
so,  not  on  the  basis  of  the  text  alone,  but  by  assimilating  new  information  into  a 
framework  of  already  existing  knowledge  which  has  been  derived  from  personal 
experience  with  the  real  world,  processed,  structured,  and  stored  in  memory,  ready 
to  be  accessed  when  required,  ({Norman  and  Rumelhart  1975],  [Miller  and  Johnson- 
Laird  1970],  [Miller  1978]).  In  other  words,  this  theory  claims  that  the  information 
language  conveys,  i.e.,  its  semantic  structure,  is  interpreted  directly  in  terms  of  the 
human  mind's  organization  of  knowledge  derived  from  experience,  a  level  of  mental 
representation  which  is  often  referred  to  as  "conceptual  structure".  This  is  assumed 
to  be  a  level  of  representation  at  which  knowledge  acquired  from  all  sources,  includ¬ 
ing  language,  vision,  nonverbal  auditory  perception,  touch,  etc.,  are  fused  and 
represented  in  a  uniform  manner. 

As  Rumelhart  and  Ortony  [1976]  remind  us,  this  view  of  language  comprehension  is 
not  entirely  new.  Similar  ideas  are  discussed  in  Bartlett  [1932],  and  much  earlier,  by 
Kant  in  his  Critique  of  Pure  Reason  (1787).  Recent  discussion  of  related  issues  can 
be  found  in  [Goffman  1974],  [Fillmore  1975],  [Bobrow  and  Norman  1975],  [Bransfcrd 
and  McCarrell  1974],  and  [Schank  1975]. 
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In  the  intelligence  environment,  the  analyst  see^  answers  to  such  questions  as 
‘‘What's  happening**?  “Who  is  doing  what  to  whom"?  “Is  the  current  situation 
changinn"0  'Who  are  the  key  figures  bringing  about  the  change"?  "What  are  the 
consequences  of  such  changes"?  "What  do  we  do  about  it"?  "What  are  our 
courses  of  action?" 

An  analyst  versed  in  a  given  subject  matter  assimilates  and  evaluates  the  informa¬ 
tion  communicated  by  the  message,  draws  conclusions,  and  produces  intelligence 
using  many  kinds  of  knowledge,  including  his  innate  knowledge  of  English  grammar,  his 
expert  knowledge  of  the  objects,  events,  and  relations  characteristic  of  his  subject 
domain,  and  his  knowledge  of  the  laws  and  probabilities  governing  the  entities  m  ;is 
domain. 

Modeling  the  processes  of  assimilation  in  a  computer  is  not  a  trivial  task.  It  is  obvi¬ 
ous  that  in  order  to  achieve  such  a  goal,  the  machine  must  must  incorporate  many 
skills.  It  must  be  provided  with  full  "language  understanding"  capabilities  modeied 
upon  those  of  the  human  reader,  i.e.,  the  system  must  be  able  to  perform  a  linguistic 
analysis  of  text,  and  subsequently  interpret  the  results  within  the  conceptual  frame¬ 
work  of  a  particular  subject  domain. 

The  next  section  examines  issues  of  knowledge  representation. 

2.3  Knowledge  Representation 

One  of  the  most  important  design  questions  facing  the  implementer  of  knowledge- 
based  systems  concerns  how  to  structure  the  knowledge  base  of  tacts  and  rules  so 
that  appropriate  items  can  be  efficiently  stored,  accessed  and  modified.  Current 
approaches  to  Knowledge  representation,  include  formal  logtc  notations,  production 
rules,  various  network  structures,  "structured  object  representations"  and  various 
combinations  of  these  methods.  "Structured  object  representations",  which  in  the  A1 
literature  are  variously  referred  to  as  "semantic  networks",  "frames",  "scripts"  and 
"schemata",  tend  to  package  information  in  "chunks".  This  is  done  by  aggregating 
several  related  facts  into  larger  structures  that  are  identified  with  important  objects 
(entities)  in  the  subject  domain  of  the  system. 

8rachman  and  Smith  [1980]  provide  a  comprehensive  survey  of  current  knowledge 
representation  research  in  the  February,  1980  issue  of  the  SJGART  Newsletter. 
Other,  more  detailed,  treatments  of  current  approaches  can  be  found  in  [Bobrow  and 
Collins  1975],  [Rumeihart  and  Ortony  1976];  [Schank  1975];  [Walker  1978];  and 
[Bobrow  and  Winograd  1977],  For  a  description  of  OSI’s  work  in  this  area,  see 
[Kuhns  1974];  [Kuhns  et  ai.  1975];  [Silva  and  [Montgomery  1978],  and  [Silva  et  ai. 
1979a,  1979b]. 

We  now  turn  to  a  discussion  of  OSI's  approach  to  the  representation  of  knowledge 
and  discuss  its  appropriateness  in  the  light  of  the  intended  application. 

OSTs  message  analysis  system  is  designed  to  process  intelligence  reports  describing 
real-world  actions,  events  and  situations,  their  associated  entities,  and  the  relations 
between  them.  Facts  of  this  kind  may  be  reported  with  various  degrees  of  uncer¬ 
tainty,  and  are  often  incomplete.  Several  messages  may  seemingly  report  on  the 
same  facts,  but  there  may  be  conflicts.  If  a  data  base,  which  no  longer  consists  of 
the  original  message  traffic  but  of  structured  content  representations  derived  from 
the  latter,  is  to  be  useful  to  the  analyst,  all  the  original  information,  including  indica¬ 
tions  of  uncertainty,  unreliability  and  incompleteness,  must  be  faithfully  preserved. 
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Another  important  consideration  when  deciding  upon  a  representation  has  to  ao  with 
the  storage  and  subsequent  retrieval  of  information.  In  the  previous  subsection  we 
presented  a  theory  of  comprehension  which  suggests  that,  when  a  person  thinks  of  a 
particular  entity,  a  number  of  other  closely  related  entities  immediately  come  into 
focus  forming  a  conceptual  whole,  or  a  "chunk"  of  information.  Or,  to  put  it  dif¬ 
ferently,  whenever  a  new  concept  is  introduced  into  a  communication,  that  concept 
introduces  a  number  of  related  concepts  into  the  local  context  or  foreground  ([Chafe 
1972];  [Miller  1978])*  if  our  approach  is  to  be  consonant  with  the  human  way  of 
processing  information,  it  would  seem  that,  when  information  is  needed  about  a  par¬ 
ticular  event,  situation  or  object,  all  of  the  relevant  facts  about  it  should  be 
retrieved  as  one  coherent  package. 

The  structured  object  representations  mentioned  previously  have  a  number  of  pro¬ 
perties  which  render  them  particularly  appropriate  for  the  representation  of  event- 
oriented  data.  As  a  consequence,  they  were  adopted  as  the  basis  for  the  develop¬ 
ment  of  Templates  and  Event  Records,  two  related  types  of  data  structures  which 
play  a  particularly  important  role  in  OSI's  approach  to  the  representation  and  deriva¬ 
tion  of  meaning  from  message  text. 

Templates  and  Event  Records 

Taking  the  event  as  the  primary  unit  of  analysis,  OS!  has  developed  the  concept  of  a 
template  as  an  organizing  principle  for  the  abstract  characterization  of  event 
classes,  their  associated  objects,  and  the  processes  which  act  upon  them. 

The  structure  of  templates  is  based  upon  the  principle  of  "information  chunking" 
mentioned  above.  The  main  function  of  a  template  is  to  group  event-related 
"chunks"  of  knowledge  into  a  single  data  structure  in  a  coherent  and  efficient  way. 
tn  this  sense,  templates  resemble  the  artificial  intelligence  notions  of  prototypes , 
frames ,  scripts  and  schemata. 

Templates  have  two  major  components:  a  descriptive  component,  and  a  procedural 
component.  The  descriptive  components  of  templates  provide  a  framework  for  the 
representation  of  knowledge  about  a  given  subject  domain  approximating  that  which 
a  human  has.  Each  individual  template  contains  invariant  knowledge  about  an  event 
class  (or  a  class  of  objects  associated  with  an  event).  Each  such  class  of  entities 
is  described  from  the  point  of  view  of  the  prospective  analyst-user  community  in 
terms  of  a  set  of  parameters  which  an  analyst  normally  perceives  as  being  associ¬ 
ated  with  that  class  of  entities  at  the  level  of  detail  required  for  the  performance  of 
his  particular  task.  Thus,  a  template  describing  the  class  of  "launch"  events 
includes  parameters  such  as  the  objects  associated  with  the  launch,  the  launch  site, 
the  launch  system  and  the  time  of  the  launch. 

The  procedural  component  of  templates  consists  of  mapping  rules  which  play  an 
active  role  In  the  processes  which  transform  narrative  text  into  event  records. 
Thus,  the  template  for  a  particular  event  class  embodies  the  necessary  procedural 
knowledge  to  construct  a  representation  of  an  individual  event  of  that  class  from  a 
sentence  describing  such  an  event.  An  important  part  of  this  information  consists  of 
what  linguists  call  the  "selectional  restrictions'1  or  "selections!  preferences"  in  a 
particular  domain. 

To  illustrate  this  notion,  consider  Table  l,  which  illustrates  the  kinds  of  information 
incorporated  in  the  "launch"  template,  as  used  for  the  automated  interpretation  of 
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Table  1.  Informal  Description  of  the  LAUNCH  Concept 
In  the  Missile  and  Satellite  Domain. 


Descriptive  Elements 

Procedural  Elements 

Descriptor 

-  - 

Filler  Specification 

-  — 

OBI  or 
OPT 

Procedures  for 
filling  slots 

Agent 

-  _ 1 

if  expressed,  then 
logical  subject  of 
sentence.  Head  noun 
with  feature  NATION 

OPT 

If  conditions  hold, 
fill  Agent  slot  with 
subject  noun  phrase 

Object 

: 

If  no  Agent,  then 

Object  in  logical 
subject  position; 
otherwise  in  object 
position.  Allowable 
features:  MISSILE 
and  SATELLITE 

OBL 

According  to  which 
conditions  hold, 
construct  Object 
template  from  either 
subject  nounphrase  or 
object  nounphrase 

Launchsys 

i 

Either  subject 
nounphrase  with 
headnoun  with 
feature  BOOSTER,  or 

PP  with  prep  BY  and 
headnoun  with  feature 
BOOSTER 

OPT 

Test  headnoun  of  subject 
for  feature  BOOSTER 

Search  VMODS  list 
for  specified 
prepositional  phrase 

|  Launchsite 

PP  with  headnoun 
(+LOC) 

Search  VMODS  list 
for  specified 
constituent 

Inclination 

i 

PP  with  headnoun 
(+INCL) 

Search  VMODS  list 
for  specified 
constituent 

|  Destination 

i 

i 

PP  with  Preps  TO  or 

INTO  and  headnoun 
with  feature  LOC 

Search  VMODS  list 
for  specified 
constituent 

j  Time 

! 

1 

1 

1 .  ADV  (TYME  REF)  or 

2.  PP  with  TYME  prep 
and  headnoun  with 
feature  TYME 

OPT 

Search  VMODS  list 
for  specified 
constituent 

;  Date 

1 

i 

PP  with  DATE-node 

1 

_ i 

OPT 

Search  VMODS  list 
for  specified 
constituent 

message  text  describing  missile  or  satellite  launch  events. 

The  entities  associated  with  the  central  concept,  i.a.,  the  agent  who  launched  the 
object,  the  object  itself,  the  orbit,  the  launch  system,  the  launch  site,  etc.,  are 
represented  by  means  of  template  ‘•descriptors".  The  latter  roughly  correspond  to 
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the  semantic  “cases*  of  linguistic  deep  cast*  theory  ([Fillmore  1968,  1971];  [Chafe 
1970]).  Cases,  in  the  linguistic  sense,  are  the  relations  which  connect  nominal  con¬ 
cepts  to  verbs.  The  "case**  ralatlons  used  in  the  work  described  here  are  derived 
from  a  representative  sample  of  message  traffic  by  means  of  an  In-depth  "intellec¬ 
tual-  analysis.  They  name  the  relations  which  hold  between  the  various  components 
of  a  complex  entity  in  a  given  task  domain  in  terms  familiar  to  the  analyst.  The 
cases  therefore  differ  somewhat  from  those  proposed  by  Fillmore  or  Chafe,  in  that 
:hev  are  more  specific  to  the  knowledge  domain  of  interest.  A  case  analysis,  as 
-.fleeted  in  a  template,  therefore,  is  compatible  with  the  intellectual  analysis  an 
analyst  would  have  arrived  at. 

Table  2  illustrates  the  notion  of  an  event  record,  which  is  the  data  structure  used  fer 
the  description  of  specific  events.  The  information  stored  in  Table  2  represents  the 
event  described  in  sentence  (a)  below. 

Sentence  (a): 

ON  14  MAY  1973,  THE  SKYLA8  ORBITAL  WORKSHOP 
WAS  FIRED  FROM  THE  KENNEDY  SPACE  CENTER  BY  A 
SATURN-6  TYPE  LAUNCHER. 

Table  2.  Event  Record  representing  the  reformation 
contained  in  Sentence  (a) 


Event:  launch 
Agent*agentunk 
Object:satelllte 

Equipment*skylab  orbital  workshop 
Quantity*  one 

Launchsys*by  a  saturn-5  type  launcher 
launchsite*from  :he  kennedy  space  center 
Time* 

Date*on  14  may  1973 


Ev^nt  records  are  related  to  templates  as  the  description  of  an  individual  »s  related 
to  the  description  of  its  class.  In  other  words,  each  event  record  describes  one 
individual ,  i.e.,  a  unique  member  of  a  class  of  individuals  in  the  world  being  modeled 

Thus,  while  the  launch  template  characterizes  a  class  of  events  in  a  general  and 
abstract  way,  the  event  record  representing  the  information  in  sentence  (a) 
describes  a  specific  launch  event  involving  a  particular  spacecraft  (e.g.,  Skylab), 
launched  from  a  particular  launch  site  (the  Kennedy  Space  Center),  by  a  particular 
launch  system  (a  Saturn -6  type  launch  vehicle),  on  a  particular  date  (14  May  '973). 
In  the  terminology  of  logic  one  would  say  that  the  reiatfon  between  a  template  and 
its  corresponding  event  records  is  roughly  the  same  as  that  which  holds  between  an 
intensions!  description  of  a  concept  and  an  element  of  its  extension.  Thus,  the  set 
of  event  records  describing  events  of  the  same  class,  i.e.,  event  records  related  to 
a  particular  template,  constitutes  the  extension  of  the  concept  described  by  the 
template. 
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Event  ac-ords  ure  derived  from  an  input  text  oy  pi ocedur^s.  a^acnou  to  individual 
descriptor  iiots.  These  procedures  metcn  the  nput  against  the  appropriate  template 
*n  an  ai tempt  to  “fill  m44  text-derived  values  for  the  descriptors  or  the  currently 
activated  template.  Consequently,  event  records,  like  templates,  are  event- 
centered  structures  in  which  the  information  conveyed  by  the  input  can  be  viewed 
from  the  perspective  or  time,  location,  type  of  activity,  object(s)  involved,  elc.  They 
provide  content  representations  for  individual  sentences  describing  atomic  events, 
and  form  the  building  blocks  of  message  content  representations  which,  in  rum 
underlie  data  base  element  synthesis. 

*^»iations  between  the  various  entitles  associated  with  an  event,  i.e..  the  set  of  puj- 
tions  connecting  descriptors  to  the  main  concept  within  a  template,  are  referr  to 
as  intra-tempiate  relations.  Intra-template  relations  can  be  used  to  alert  the  analyst 
to  missing  ^formation.  To  see  this,  consider  the  event  record  for  sentence  (a).  For 
the  event  record  to  be  complete,  a  value  must  be  found  tor  all  the  descriptor  slots  of 
its  corresponding  template.  Notice,  however,  that  the  message  fragment 
represented  by  sentence  (a)  is  Incomplete  in  terms  of  the  "launch1*  template  specifi¬ 
cation  outlined  m  rable  1.  Conspicuously  absent  are  values  for  the  Agent  and  Time 
descriptors.  Since  sentence  fa)  contained  no  explicit  mention  of  who  launched  the 
spacecraft,  but  the  sentence  was  passive,  the  Agent  slot  was  given  the  default 
vaiw«  of  iger*unk"  \ agent  unknown)  by  the  system.  The  Time  sic*,  on  the  other 
hand,  was  eft  empty 

The  concept  of  Time,  because  of  its  complexity  in  the  intelligence  environment,  also 
requires  a  separate  template  for  its  abstract  description.  Time  is  not  always  expli¬ 
citly  stated  m  a  message.  It  and  must  often  be  computed  from  the  tense  of  the 
verbs  occurring  n  the  message,  and  such  complex  natural  language  expressions  as 
"AT  10  MINUTE  INTERVALS  FOR  A  PERIOD  OF  6  HOURS'*.  Moreover,  time  operators 
such  an  ‘currently41,  require  reference  to  information  in  the  message  header  or  the 
message  ext  for  :hetr  resolution.  The  absence  of  any  expected  element  in  an 
event  'ecord  can  be  used  to  inform  the  analyst  that  some  description  of  an  object  or 
e^ent  »s  incomplete. 

While  the  >ntr  attempt  ate  relations  described  above  define  the  internal  composition  of 
concepts,  mter -template  relations  specify  the  relationships  between  concepts, 
inter  concept  relations  are  also  represented  by  templates.  For  example,  the  temporal 
relation  of  precedence  (oMen  expressed  in  language  as  a  prepositional  phrase  with 
"he  preposition  "before14)  s  defined  as  a  template  which  links  two  event  h>»  ^s 
event  type  A  normally  takes  place  before  event  type  B,  where  each  event  type  .> 
tseif  represented  by  a  template. 

Note  that,  as  opposed  to  intra-template  relations,  inter-template  relations  ire  ^or- 
stramed  0/  the  physical  laws  and  normal  expectations  of  the  world  being  modeled 
Examples  of  inter-template  relations  are  causal  relations,  temporal  succession  rela¬ 
te  uoa,  ana  3tner  impilcational  relations  between  events.  A  spacecraft  which  is  suc¬ 
cessfully  launched  will  normally  attain  orbit,  remain  in  orbit  for  a  certain  number  of 
revolutions,  and  subsequently  deorbit,  reenter  the  earth's  atmosphere,  break  jp,  burn 
up  or  crash  to  earth,  in  which  case  it  can  be  recovered,  inter-template  relations,  as 
ntr a  template  relations,  can  be  used  in  a  predictive  capacity. 

To  sum  up,  templates  are  models  of  objects,  events,  and  situations  habitually 
encountered  m  the  domain  of  application.  They  provide  the  means  of  coding  the 

2-8 


r 


analyst’s  knowledge  ot  his  task  domain  ,n  terms  of  logical  structures  which  are  easily 
transformed  into  representations  susceptible  to  automatic  processing.  They  are 
active  memory  structures  which  embody  hypotheses  about  facts,  processes,  opera¬ 
tions,  procedures  and  computations  required  to  characterize  the  links  between  'arm 
and  content.  Procedurally,  they  can  best  be  described  as  the  fundamental 
knowledge  structures  which  mediate  the  correlations  between  syntactic  structures 
and  their  corresponding  information  content. 

Templates  can  be  helpful  in  discovering  inconsistencies  both  »n  the  input  and  »n  the 
system’s  model  of  the  world.  If  information  derived  from  the  input  cannot  be  fitted 
icto  one  of  the  templates  stored  in  memory,  it  may  mean  that  there  are  ir.consisten- 
c*qs.  For  example,  finding  that  a  missile  is  reported  as  being  in  orbit,  is  inconsistent 
with  the  expected  actions  a  missile  is  designed  to  perform.  On  the  other  hand,  "he 
Inability  ot  the  system  to  account  for  the  input  may  also  mean  that  the  structure  of 
the  model  is  incomplete:  either  a  component  is  missing,  or  that  this  is  a  possible  but 
atypical  situation  for  which  there  is  no  representation  in  the  system. 

Each  subject  domain  yields  its  own  template  inventory  corresponding  to  the  events 
and  objects  and  their  intemaJ  and  external  relations  —  optional  or  obligator/  —  winch 
have  informational  significance  within  that  domain,  in  the  Miss.le  and  Satellite 
domain,  some  of  the  key  concepts  for  which  templates  have  been  constructed  are: 
“launch11,  “deorbit1*,  “reentry11,  “breakup'1,  “impact*1,  “missile*1,  “satellite*’,  and  “DIG" 
(date/time  group). 

Event  records,  on  the  other  hand,  contain  descrp  Jons  of  individual  events,  objects, 
and  their  properties.  They  have  several  important  properties  which  render  them  par¬ 
ticularly  useful  as  a  support  tool  for  the  l&W  analyst: 

e  They  present  information  to  the  analyst  in  the  form  of  a  structure  which  is 
compatible  with  his  view  of  the  world  and  are  therefore  easier  to  read  and  dig¬ 
est  than  unstructured  text.  It  is  suggested  that  this  should  maximize  the 
analyst’s  ability  to  monitor  and  assimilate  incoming  data. 

e  They  are  usable  for  the  construction  of  an  analyst’s  data  base,  which  allows 
flexible  retrieval  of  information  not  only  by  event  type,  but  also  by  other  asso¬ 
ciated  parameters,  such  as  object(s)  nvolved  in  the  event  and  time  and  loca¬ 
tion  indicators. 

e  The  information  stored  in  these  data  structures  lends  itself  readily  to  further 
processing.  This  processing  may  be  statistical  in  nature,  or  may  be  part  of  the 
general  inference  making  mechanisms  of  the  message  data  base. 

It  is  our  contention  that  adopting  templates  and  avent  records  as  representational 
constructs  in  ths  data  acquisition  component  of  a  message  data  base  has  significant 
advantages  over  the  use  of  production  rules,  and  can  lead  to  powerful,  effective, 
end  efficient  systems. 

Aa  explained  earlier,  templates  represent  the  system’s  current  hypothesis  about  how 
to  interpret  incoming  message  data.  The  procedures  associated  with  template  slots 
are  executed  for  the  purpose  of  verifying  the  current  hypothesis.  Execution  of  pro¬ 
cedures  is  thus  strongly  constrained  by  the  currently  activated  template,  n  a  pro¬ 
duction  system  without  such  structures,  rules  are  generally  executed  in  a  mode 
which  may  cause  irrelevant  rules  to  be  tried.  A  model-based  system,  therefore,  is 
potentially  faster  because  needless  rule  invocation  tends  to  be  minimized. 
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3.0  MESSAGE  PROCESSING  IN  THE  MATRES  III  SYSTEM 


The  principles  and  data  structures  discussed  above  are  implemented  in  the  MATRES 
Message  Analysis  System,  which  specifically  addresses  the  problem  of  automating 
the  analysis  of  the  narrative  text  portions  of  intelligence  messages  describing 
events  in  the  Air  Activities  and  Missile  and  Satellites  domains.  The  analytical  pro¬ 
cedures  employed  by  MATRES  automatically  identify  the  topics  and  entities  referred 
to  in  a  text,  including  the  meaning  relations  which  hold  among  those  entitles.  In  the 
process,  they  transform  verbose  message  text  into  succinct,  computer-processabie 
content  representations,  by  performing  a  lexical,  syntactic,  semantic,  and  pragmatic 
analysis  of  the  text.  The  outputs  of  this  analysis  are  the  “event  records"  described 
in  the  previous  section.  They  serve  as  a  basis  for  the  construction  of  the  analyst's 
data  base. 

The  analytical  procedures  of  MATRES  utilize  two  major  knowledge  sources:  (1)  a 
model  of  the  sublanguage  that  characterizes  the  domain  of  application  and  (2),  a 
model  of  the  entities  and  relations  characteristic  of  this  domain. 

The  term  sublanguage  is  used  to  refer  to  the  specialized  usage  of  English  in  a  partic¬ 
ular  domain  of  discourse.  The  sublanguages  used  in  message  traffic  differ  from  gen¬ 
eral  English  in  several  respects.  First,  they  are  restricted,  in  the  sense  that  they 
normally  use  a  small  and  specialized  vocabulary.  Second,  they  consist  only  of 
declarative  sentences  (or  fragments  thereof);  there  are  no  questions  or  commands. 
Furthermore,  such  reporting  languages  are  usually  characterized  by  heavy  use  of 
abbreviations,  and  often  employ  grammatical  constructions  which  deviate  from  those 
of  “normal*1  English,  (e.g.,  dropped  articles,  dropped  prepositions,  and  idiosyncratic 
usage  of  words.  As  an  example  of  idiosyncratic  vocabulary  usage,  consider  the  verb 
“deploy".  In  ordinary  discursive  prose,  this  verb  is  normally  used  with  an  animate 
agent:  somebody  deploys  something.  In  messages  reporting  on  aircraft  activities, 
aircraft,  although  inanimate,  and  therefore  not  “agentive11  in  the  strict  sense  of  the 
word,  deploy  themselves.  Also,  sentences  are  mostly  passive;  constructions  with  the 
agent  expressed  in  surface  structure  occur  infrequently.  The  differences  are  often 
so  marked,  that  it  turns  out  to  be  more  convenient  to  write  specialized  grammars  for 
these  sublanguages,  than  it  would  be  to  adapt  a  grammar  written  for  “normal'* 
English.  This  was,  as  will  become  apparent  later,  to  a  certain  extent  the  case  in  the 
work  described  here. 

The  basic  unit  of  analysis  adopted  for  this  work  is  the  “Event  Report1',  which  is 
described  in  detail  below. 

The  two  subject  domains  which  have  served  as  a  basis  for  the  development  of 
MATRES  are  aircraft  activities  and  missile  and  satellite  events.  Although  the 
automated  database  generation  methodology  incorporated  in  MATRES  was  developed 
on  the  basis  of  the  somewhat  restricted  subject  domains  just  mentioned,  it  is 
presumed  to  be  extensible  to  applications  of  greater  scope  than  the  current  one. 

3,1  Current  Domain  of  Application* 

The  current  domain  of  application  of  MATRES  covers  a  type  of  text  which  describes 
observable  events  involving  objects  such  as  aircraft,  missiles,  satellites  and  space¬ 
craft,  whose  activities  normally  fail  into  predictable  patterns. 

Of  fundamental  importance  in  this  work  Is  the  notion  of  EVENT ,  which  is  defined  as 
the  aggregate  of  all  the  states,  processes,  and  actions  associated  with  an  object  or 
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a  set  of  objects  from  the  inception  to  the  termination  of  some  larger  activity.  To 
illustrate  this  notion,  consider  the  following  EVENT  REPORT,  which  describes  the 
launch  of  Skylab  and  the  various  actions  it  was  involved  in  from  the  time  it  was 
launched  to  the  time  it  reentered  the  earth’s  atmosphere  and  impacted  in  Western 
Australia. 

Skylab  was  launched  from  the  Kennedy  Space  Center  at  approximately 
1  pm  on  May  14th,  1973.  A  Saturn-5  type  launch  vehicle  was  used  to 
inject  the  spacecraft  into  an  orbit  inclined  at  50  degrees  to  the  equa¬ 
tor.  Skylab  incurred  serious  damage  at  lift-off  time  and  was  later 
repaired  by  the  astronauts  previously  scheduled  to  rendezvous  with  it. 

The  astronauts,  launched  in  a  modified  Apollo  service  module,  attached 
themselves  to  Skylab  by  “umbilical*1  cords  (connections  to  life  support 
systems),  and  salvaged  the  ailing  spacecraft.  Subsequently,  Skylab 
was  used  for  experiments,  and  was  later  abandoned.  Recently,  it 
caused  much  concern,  because  it  was  obvious  Skylab  was  going  to 
deorbit  and  crash  to  earth.  It  did  indeed  reenter  the  earth’s  atmo¬ 
sphere,  upon  which  it  began  to  break  up  into  pieces,  some  of  which 
burned  up  on  reentry.  Others  impacted  in  the  Indian  Ocean,  and  yet 
others  landed  in  Western  Australia  and  were  recovered. 

The  text  of  this  EVENT  REPORT  was  artificially  composed  on  the  basis  of  articles 
which  appeared  in  the  Los  Angeles  Times,  Newsweek,  and  Aviation  Week,  between 
1973  and  1979.  Normally,  EVENT  REPORTS  comprise  several  intelligence  messages 
received  over  a  period  of  time,  rather  than  a  single  paragraph. 

Events  of  the  kind  described  above  usually  have  a  complex  internal  structure  com¬ 
posed  of  smaller  interrelated  units  of  action  —  in  this  case  the  launch,  the  deorbit, 
the  impact,  etc.  These  smaller  units  are  referred  to  as  atomic  events.  Atomic  events 
are  the  basic  units  of  description  at  the  event  level.  An  atomic  event  cannot  have 
another  event  as  a  component.  The  intemai  structure  of  atomic  events  reflects  the 
participants  n  the  event  and  their  relations  to  the  central  action. 

3.2  Prolog  Representation  of  Templates  and  Event  Records. 

Our  current  domain  of  discourse  is  chsractenzed  by  aircraft,  missiles  and  satellites, 
their  respective  properties,  the  purposes  for  which  they  can  be  used,  the  actions 
that  can  be  performed  on  them,  the  actions  in  which  they  can  participate,  their  par¬ 
ticular  role  in  each  event  type,  and  the  relations  between  such  events  in  the  con¬ 
text  of  a  larger,  coherent  activity. 

To  model  the  objects,  event  classes  and  relationships  of  our  domain,  we  defined  a 
set  of  templates  which  embody  the  system’s  knowledge  of  the  possible  objects, 
events  and  relationships  in  the  domain  of  application.  At  the  same  time,  they  contain 
the  procedural  knowledge  necessary  for  mapping  input  sentences  onto  data  struc¬ 
tures  called  event  records . 

in  our  system,  templates  are  implemented  as  Prolog  procedures,  while  event  records 
take  the  form  of  Prolog  “terms'*.  As  mentioned  above,  the  template  for  a  particular 
ciass  of  atomic  event  embodies  the  necessary  information  to  construct  an  event 
'ecord  from  a  sentence  describing  an  event  of  that  class.  Thus,  the  template  shown 
m  Time  1,  when  applied  to  sentence  (1),  yields  the  event  record  shown  in  Table  2. 
Each  event  record  names  the  atomic  event  represented,  and  records  those 


(1)  The  Skylab  orbital  workshop,  a  converted 
S-4B  third  stage  from  a  Saturn-5  launch 
vehicle,  successfully  deoroited  into 
the  Australian  outback  on  12  Jul  19  79, 


Table  1.  DEORBIT  Template 


Table  2.  Event  Record  for  Sentence  (1) 

Status=successfully 
Event:  deorbit 
0bject:satelllt*3 

Equipment=skylab  orbital  workshop 
Quantity3  one 

Add_ Attributes-  a  converted  s-4b  third  stage  j 
from  a  saturn-5  launch  vehicle  j 

location-into  the  australian  outback 
Revolution3 
Time3 

Oate3on  12  July  1979 

properties  which  were  explicitly  expressed  in  the  input  sentence. 

To  achieve  a  content  representation  for  an  entire  Event  description,  a  higher-level 
template  for  the  abstract  representation  of  larger  event  types  was  defined.  Such  a 
template  represents  characte  istic  patterns  of  atomic  events  in  a  given  domain  of 
application.  When  applied  to  a  coherent  text,  it  constructs  a  meaning  representation 
reflecting  the  conceptual  organization  of  the  text  as  a  whole. 

Table  3  shows  a  possible  Prolog  representation  of  a  higher-level  template,  which, 
when  implemented,  will  serve  to  derive  meaning  representations  from  a  class  of 
event  descriptions  resembling  the  Skyfab  text. 

The  following  section  illustrates  the  processes  involved  in  the  computer  ”under- 
standing"  of  message  text. 

3.3  Principles  of  Message  Text  Analysis. 

From  the  theoretical  viewpoint,  the  primary  goal  of  natural  language  understanding  is 
to  arrive  at  the  fora/  meaning  content  of  a  text.  This  is  usually  taken  to  include  all 


Table  3  Prolog  Representation  of  SPACECRAFT  ACTIVITY  Template 


assemble  (spacecraft.,  act:  vi  ty,  [list  of  event  records], 

[connected  discourse  structure] 

launch  { . ) , 

achieve^orb: c  ;; . ' , 

deorbi  1 1 . ) , 

reentry  ; . '  , 

breakup  c . ) , 

impact  . ^ , 

recover  t . ) . 


the  tacts  *hich  are  explicitly  recorded  plus  those  facts  which  can  be  inferred  from 
the  meaning  of  the  words  appearing  in  that  text  and  their  syntactic  and  semantic 
interrelations*  Such  a  goal  may  lead  to  meaning  representations  which  are  overbur¬ 
dened  with  detail,  which  wcuia  oe  irrelevant  in  the  context  of  this  work. 

A  human  anaiyst  s  selective.  He  does  not  seek  to  extract  a//  the  information  a  text 
may  contarn,  but  only  that  which  is  is  needed  for  the  performance  of  his  task.  The 
goai  of  this  work,  then,  ts  to  disregard  irrelevant  detail;  message  text  Is  transformed 
Into  content  structures  reflecting  the  analyst's  view  of  the  world. 

The  logical  unit  of  analysis  adopted  as  a  basis  for  describing  intelligence  information 
is  the  EVENT  REPORT. 

The  0S1  natural  language  processing  system  (MATRES)  is  based  upon  a  process 
model  of  text  understanding  involving  four  sets  of  operations. 

First,  the  sentences  of  a  message  text  are  parsed  into  a  set  of  propositional  struc¬ 
tures.  The  propositions  are  linked  by  various  semantic  relations  which  may  be  expli¬ 
citly  expressed  in  the  surface  structure  of  the  text,  or  inferred  during  the  interpre¬ 
tation  process  on  t*he  basis  of  contextual  and/or  real  world  knowledge. 

Second,  the  resulting  set  of  propositions  are  organized  into  higher-level  conceptual 
categories,  namely*  event  representations. 

The  first  two  operations  described  above  are  implemented  in  the  current  version  of 
OSI’s  message  text  analysis  system  MATRES*  and  will  be  described  in  detail  in  the 
next  section. 

A  third  set  of  operations  links  the  resulting  event  representations  into  a  coherent 
whole,  reflecting  the  meaning  of  the  message  text  as  a  whole. 

Finally,  when  all  messages  constituting  an  EVENT  REPORT  are  processed  in  this 
manner,  a  set  of  constraints  checks  the  coherence  of  the  set  of  messages  consti¬ 
tuting  an  EVENT  REPORT  at  the  global  level,  i.e  ,  at  the  level  of  the  EVENT  REPORT. 

Many  of  the  subprocesses  involved  in  language  understanding  are  still  largely  unex¬ 
plored  and  it  is  therefore  not  possible  to  construct  a  comprehensive  model  of 
languaqe  understanding.  For  example,  no  complete  specification  for  the  many  and 
complex  inferential  processes  involved  in  language  understanding  can  be  given  at 
this  stage,  although  work  in  progress  in  fhe  fields  of  Artificial  'nteiligence  and 
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Cognitive  Science  is  very  promising. 

3.4  Analytical  Processes. 

3.4.7  Sentence- Level  Analysis  In  the  current  implementation  of  OSI's  message  text 
analysis  system  (MATRES),  the  understanding  process  begins  when  a  sentence  of 
the  message  text  is  input  to  the  system.  This  can  be  done  either  from  a  terminal,  or 
from  a  disk  file.  As  a  first  step  in  the  analysis,  each  sentence  of  an  input  text  is 
parsed  by  a  syntactic  processor  implemented  directly  as  a  Definite  Clause  Grammar 
(DCG)  [Pereira  and  Warren  1980].* 

The  currant  scope  of  the  MATRES  sentence  grammar  covers  declarative  sentences 
with  simple  and  complex  noun  phrases,  sentential  and  infinitival  complements,  and  a 
large  range  of  adverbials.  The  grammar  currently  comprises  80  DCG  rules  of  the  form 

nt  — >  body. 

where  nt  is  a  non-terminal  symbol  with  one  or  more  arguments,  and  body  is  a 
sequence  of  one  or  more  goa/s  separated  by  commas.  This  is  illustrated  by  the  rule 
below,  which  builds  the  parse  tree  (S)  of  an  input  sentence. 

sentence  (S)  — > 

/*  preposed  adverbials  */  adverbials (PreAdv)  , 

/*  grammatical  subject  */  noun_phrase (G_Subj) , 

/*  evaluative  adverb  (e.g. ,  probably)  */  evaladvb (Evaladvb) , 
verb  (Verb,  Tense) , 

rest.sentence  (PreAdv,  G_Subj,  Evaladvb,  Verb,  Tense,  S) ,  {!  }  . 

The  associated  lexicon  takes  the  form  of  Prolog  clauses.  Examples  of  lexical  entries 

are: 

adj  (central) . 
noun  (satellite) . 

verbform (launched,  launch,  past). 

The  output  of  this  stage  is  a  shallow  parse  tree  with  mostly  standard  grammatical 
categories  (s,  np,  pp,  n,  v,  adv,  etc.)  at  nodes.  The  grammar  also  contains  a  few 
nonstandard  definitions  of  syntactic  categories  which  reflect  idiosyncratic  usage  of 
language  in  our  particular  domain  of  application,  and  were  introduced  to  expedite  the 
parsing.  Their  main  function  is  to  help  avoid  false  parsing  paths.  Some  nonstandard 
phrasal  categories  (e.g.,  tm  for  "time  phrase'1,  and  mp  for  "measure  phrase"  )  can  be 
observed  in  Table  4b,  which  displays  the  internal  representation  of  a  sentence  after 
syntactic  analysis. 

The  parser  is  currently  constrained  so  as  to  provide  only  one  analysis  per  sentence. 
This  constraint  was  implemented  by  using  the  Prolog  "cut  symbol",  which  restrains 
the  sentence  grammar  procedure  from  returning  any  but  the  first  parse;  the  grammar 
itself  is  capable  of  returning  multiple  parses. 


*  The  Definite  Clause  Grammar  formalism  described  in  Pereira  at  al.  is  based  upon 
Cotmerauer’s  Metamorphosis  Grammar ,  the  original  grammar  formalism  for  natural 
language  analysis  using  definite  clauses  [Coimerauer  1975],  We  chose  the  DCG 
formalism  as  a  model  for  this  work,  because  of  its  close  relation  to  Prolog  and  its 
expressive  power,  as  described  by  Pereira  and  Warren. 
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3.4.2  From  Parsed  Sentences  to  Atomic  Events  The  next  step  in  the  analysis  of  a 
sentence  is  to  derive  its  meaning  in  terms  of  atomic  events .  where  an  atomic  event 
corresponds  to  a  simple  proposition  referring  to  an  action  which  changes  the  prevail¬ 
ing  state  of  affairs.  To  achieve  this  objective,  the  program  selects  and  activates 
the  appropriate  template  on  the  basis  of  certain  characteristics  stored  in  the  lexical 
entry  of  the  main  verb  of  that  sentence.  Control  now  passes  to  the  template,  which 
actively  seeks  "fillers"  for  its  "slots"  to  construct  a  content  representation  in  the 
form  of  one  or  more  event  records.  These  "fillers"  are  selected  by  th«-  syntactic 
analyzer  on  the  basis  of  syntactic,  semantic  and  pragmatic  constraints  on  the  consti¬ 
tuents  of  the  parse  tree  output. 

The  interpretive  component  of  MATRES  consists  of  a  module  for  syntactic  normals:  .1- 
tlon,  seven  event  templates  and  their  associated  procedures,  two  object  templates, 
and  a  template  for  the  date/time  concept.  If  one  includes  the  top-level  controlling 
procedures  which  initiate  the  semantic  interpretation  of  a  parse  tree,  the  total 
number  of  Prolog  clauses  used  by  the  interpretive  module  is  close  to  1  00 

An  Illustrative  Example 

The  following  is  an  example  of  how  our  programs  interpret  the  complex  sentence  (4) 
which  describes  an  expected  deorbit  event.  The  internal  representations  shown  are 
explained  in  full  detail  in  the  program  documentation  [Dwiggins  and  Silva  1981]. 

(4)  The  two  81  degree  probable  second  generation  satellites, 
which  were  launched  from  the  Kennedy  Space  Center 
earlier  today,  were  expected  to  deorbit  over  Western  Australia 
Just  northwest  of  Kalgoortie  at  1 330  hours  this  date. 


Table  4a.  Internal  representation  of  sentence  (4)  after  being  read  in. 

[the,two,nb(31, 2), degree,probable, third, generation, satellites,,, 
whlch,were, launched, from, the, kennedy, space, center, earlier,  today,,, 
were, expected,to,deorbit,over, western, australia, 

Just,northwest,of,kalgooriie,  at,nb(1 330, 4), hours, this, date,.] 

The  fist  of  words  shown  in  Table  4a  constitutes  the  input  to  the  parser.  In  our  sys¬ 
tem,  sentences  have  seven  immediate  constituents:  tense,  voice,  logical  subject, 
verbgroup,  logical  object,  complement,  and  a  list  of  adverbials.  Table  4b  shows  the 
syntactic  structure  assigned  to  sentence  (4)  by  the  parser. 

Notice  that  the  parser  converts  the  passive  input  sentence  of  Table  4a  into  its 
active  form  and  supplies  the  word  "agentunk"  (agent  unknown)  for  the  missing  sub¬ 
ject  of  "expect".  The  same  procedure  is  applied  to  the  relative  clause,  "which  were 
launched  from  the  Kennedy  Space  Center  earlier  today".  Furthermore,  clauses  with 
"extraposed"  subjects,  such  as  the  infinitival  complement  "to  deorbit  over  Western 
Australia....",  are  also  assigned  propositional  structure.  Accordingly,  in  the  analysis  of 
sentence  (4),  the  entire  complex  noun  phrase  describing  the  two  satellites  in  ques¬ 
tion  is  copied  to  become  the  logical  subject  of  the  verb  "deorbit".  There  are,  how¬ 
ever,  cases  of  extraposition  which  are  not  easily  handled  by  a  DCG  grammar.  For  a 
discussion  of  extensions  to  the  DCG  formalism  specifically  designed  to  handle  cases 
of  extraposition  in  a  general  way,  see  [Pereira  1  980]. 
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Table  4b.  Internal  representation  of  sentence  (4)  after  syntactic  analysis 


s( 

tns(past), 

voice(psv), 

nP([]»[].nn°de(a9entunk,[]),[]), 
vg([], expect), 

np(dp([],the,nbr([],two)),[mp(nbr([],nb(81 , 2)),  u(  degree,  angular)), 
probable, third, generation], nnode(sateilites,[]), 

[s( 

tns(past), 

voice(psv), 

nP([].[]»nnode(agentunk,[]),[]), 

vg([]»launch), 

np(dp([],the1nbr([],two)),[mp(nbr([],nb(8t  ,2)  ),u(  degree, angular)), 
probable, third, generation],nnode(satellites, []),[]),[], 
[pp([],from,np(dp([],the,[]),[kennedy,space],nnode(center,[]),[])), 
earlier, today])]), 
s( 

tns(present), 

voice(act), 

np(dp([],the,nbr([],two)),[mp(nbr([],nb(81 , 2)), u(degree, angular)), 
probabie,third,ger/eratlon],nnode(satelIltes,[j), 

[s( 

tns(past), 

voice(psv), 

nP([]»[]>nnode(a9entunk, []),[]), 
vg([], launch), 

np(dp([],the,nbr([],two)),[mp(nbr([],nb(81 , 2)), u(degree, angular)), 
probable, third, generation]Jnnode(satel!ltes, []),[]),[], 
[PP([]ifrom,np(dp([],the,[]),[kennedy, space], nnode(center, []),[])), 
earlier, today])]), 
vg([],deorbit),[],[], 

[PP([].over,np(dp([], [],[]), [western], nnode(australia, []),[])), 
PP([just],northwest  of, np(dp([],[],[]),[],nnode(kalgoorlie, []),[])), 
tm(at,mp(nbr<[]inb( 1  330, 4)), hours)), 
np(dp([],  this, []),[],nnode(date,[]), [])]),[]) 

There  are  a  number  of  points  worth  noting  regarding  the  interpretation  of  the  various 
syntactic  components  of  sentence  (4), 

First,  the  higher  predicate  expect  is  interpreted  as  an  event  status  indicator,  and  as 
such  is  accepted  as  an  appropriate  filler  for  Status  operator  modifying  the  deorbit 
template  (see  Table  4c).  Other  verbs  indicating  status  are  fa/7,  and  continue .  Verbs 
of  communication  taking  sentential  complements  are  interpreted  as  signaling  the 
source  of  the  report,  while  "speaker-oriented11  adverbs  such  as  "successfully”, 
"probably”  and  "possibly”,  contribute  to  the  "certainty”  of  the  entity  with  which  they 
are  associated. 


Table  4c.  Event  Record  for  sentence(4) 


Status=expect 
Event;  deorbit 
Objectrsatellite 

Equipment=81  degree  probable  third  generation  satellites 

Quantity=two 

Rel_  events 

Event;  launch 
Agent=agentunk 
Objectisatellite 

Equipment=81  degree  probable  third 
generation  satellites 
Quantity=two 

launchsite=from  the  kennedy  space  center 
Destinations' 

Time*earlier 

Date-today 

Locatlon=over  western  australia  just  northwest  of  kalgocrlie 
Revolution^ 

Tlme=at  1  330  hours 
Oate-this  date 

Second,  both  the  relative  clause  embedded  in  the  subject  noun  phrase  ("which  were 
launched  from  the  kennedy  Space  Center")  and  the  infinitival  complement  (“to  deor¬ 
bit  over  Western  Australia  just  northeast  of  Kalgooriie  at  1300  hours  this  date") 
describe  atomic  events  in  the  sense  defined  previously,  and  are  therefore 
transformed  into  a  event  records.  The  “deorbit"  event  described  by  the  infinitival 
complement  is  Interpreted  as  the  main  event  reported  in  the  sentence.  The  “launch" 
event  described  by  the  relative  clause,  on  the  other  hand,  refers  to  a  previously 
reported  event,  and  is  interpreted  as  a  related  event.  It  becomes  the  value  of  the 
event  attribute  of  the  Object  template.  Relative  clauses  which  do  not  describe 
events,  but  represent  additional  properties  of  the  head  noun  are  interpreted  as  Other 
Attributes .  Since  such  attributes  were  not  present  in  the  sentence  under  discussion, 
they  do  not  appear  in  its  event  record  (Table  4c). 

This  concludes  our  brief  sketch  of  the  processes  involved  in  the  analysis  of  indepen¬ 
dent  sentences.  These  processes  are  repeated  until  all  sentences  of  the  text  are 
transformed  into  event  records.  From  the  above,  it  can  be  seen  that  the  number  of 
event  records  constructed  for  a  particular  sentence  depends  on  the  number  of 
atomic  events  described  by  that  sentence.  Thus,  while  simple  sentences  yield  one 
event  record ,  complex  sentences  yield  event  records  in  which  some  of  the  slots 
contain  other  event  records. 

As  the  sentences  constituting  an  event  description  are  processed  in  the  manner 
described  above,  the  resulting  event  records  are  concatenated  into  a  list.  The  latter 
constitutes  the  output  of  the  sentence-level  analysis,  and  also  serves  as  input  to 
the  text-level  interpretive  procedures. 
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3.4,3  Beyond  the  Atomic  Event  At  this  point,  a  higher-level  template  (not  yet  imple¬ 
mented)  is  activated.  It  takes  as  input  the  list  of  event  records  derived  by  the  pre¬ 
viously  described  processes,  and  assembles  tnem  into  a  connectec  discourse  struc¬ 
ture  reflecting  the  meaning  of  the  text  as  a  whole.  A  first  approximation  to  such  a 
template  was  shown  in  Table  3. 

It  is  at  this  level  that  some  anaphoric  references  are  resolved,  and  temporal,  causal, 
and  other  text-level  relations  between  atomic  events  are  established.  It  ts  impor¬ 
tant  to  note,  however,  that  not  all  types  of  anaphora  can  be  resolved  wtth  currently 
known  techniques. 

The  Prolog  formalism  lends  itself  well  to  the  expression  of  certain  text-level  rela¬ 
tions.  For  example,  returning  to  Table  3,  observe  that  the  goals  in  the  body  of  the 
"assemble"  clause  embodying  the  "spacecraft  activity"  template  are  ordered  so  as 
to  reflect  the  physical  laws  governing  the  permissible  succession  of  atomic  events 
within  the  larger  event  class  represented  by  that  clause.  Thus,  the  goal  ordering  in 
Table  3  reflects  the  constraint  that  a  spacecraft  can  only  achieve  orbit  after  it  has 
been  launched,  and  thereafter  must  deorbit  before  it  can  crash  to  earth  and  impact. 
Moreover,  by  making  some  of  the  goals  optional,  we  can  express  two  important  facts. 
One:  not  all  atomic  events  conceptually  associated  with  a  larger  event  need  neces¬ 
sarily  take  place,  as  a  deorbited  satellite  need  not  necessarily  impact;  it  can  burn  up 
upon  reentry  and  never  crash  to  earth.  Two:  not  all  actions  performed  by  a  space¬ 
craft  need  be  explicitly  recorded  in  a  text  reporting  on  the  activities  of  that  space¬ 
craft:  a  first  report  may  mention  a  launching,  and  i  subsequent  one  an  impact.  The 
deorbit,  which,  according  to  the  physical  laws  governing  this  type  of  event  must 
have  taken  place,  is  implied. 

It  is  important  to  note  at  this  point  that  modeling  the  permissible  temporal  sequence 
of  atomic  events  is  only  a  first  step  in  achieving  an  adequate  representation  of 
discourse  structure.  A  great  deal  more  needs  to  be  done  if  ail  relevant  information  is 
to  be  represented.  Interpretive  procedures  required  at  this  level,  including  inferen¬ 
tial  procedures  for  establishing  coherence  at  the  text  level  within  our  domain  of 
investigation  are  currently  under  study. 
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4.0  MATRES  111  MOOEL  DATA  BASE  SYSTEM 

In  order  to  demonstrate  the  capabilities  of  the  technology  described  in  the  previous 
sections  for  event  record  structuring  within  a  more  complete  environment,  we  have 
added  to  the  system  a  simple  data  storage  and  retrieval  capability,  emulating,  to  a 
limited  extent*  the  type  of  data  base  and  analysis  system  that  could  best  use  the 
type  of  event  record  structure  that  MATRES  III  generates.  Such  a  system  must  be 
rather  different  from  a  standard  data  base  system,  since  an  event  record  may  have 
repeated  slots  of  the  same  type  and  optional  slots,  along  with  hierarchical  relations 
among  event  records.  In  addition,  event  reports  of  the  type  handled  by  MATRES  Ml 
may  contain  incomplete,  uncertain,  and/or  conflicting  information.  Also,  matching 
queries  aga^st  components  of  an  event  record  is  more  complicated  than  simple  iden¬ 
tity  testing;  rjmeric  or  semantic  similarity  must  be  considered  if  the  intent  of  the 
query  Is  to  be  well  carried  out.  In  this  system,  we  present  a  simple  model  of 
retrieval  of  event  records  according  to  a  set  of  matching  criteria,  each  peculiar  to 
the  type  of  slot  under  consideration. 

4.1  Function 

The  basic  functioning  of  our  model  data  base  system  is  as  follows:  sentences  are 
processed  and  event  records  (ERs)  are  displayed  and  stored  The  retrieval  phase 
searches  the  stored  ERs  against  user  queries.  A  query  consists  of  a  sentence  like 
those  processed  in  the  data  base  generation  phase,  but  allowing  particular  com¬ 
ponents  to  be  replaced  by  slot  names  bounded  by  *<*  and  ‘>\  e.g. 

<Object>  deorbited  on  12  juiy  1979. 

Skylab  was  launched  by  <Launchsys>. 

The  missile  Impacted  before  <Time>  on  <date>. 

The  satum-o  flew  to  <Location>. 

These  ‘'open  sentences'*  are  parsed  into  Query  Records  (QRs),  which  are  then 
matched  against  the  stored  ERs;  those  ERs  that  match  are  displayed  in  their 
entirety.  The  following  sections  discuss  the  structure  of  QRs  and  the  matching  pro¬ 
cess;  the  detailed  structure  of  ERs  is  described  in  [Dwiggins  and  Silva  1981]. 

4.2  Open  Sentences  and  QRs 

4.2.7  Slot  Names  The  only  structural  difference  between  normal  sentences  of  the 
corpus  and  open  sentences  is  the  presence  of  "slot  names"  in  open  sentences.  As 
indicated  above,  a  slot  name  is  word  enclosed  in  angle  brackets,  e.g.  "CLocationV 
or  l,<Date>11.  The  following  list  shows  the  slot  names  currently  recognized  by  the 
system,  along  with  the  syntactic  contexts  in  which  they  may  occur. 

Agent 

represents  th  *  'uman  or  institutional  agent  causing  the  action  of  the  given 
satellite  or  mu**.,*;  it  may  occur  as  the  subject  of  an  active  sentence  whose 
abject  is  a  satellite  or  missile,  or  in  a  "by"-phrase  in  a  passive  sentence  of  that 
type  (e.g.  "<Agent>  launched  ..."  or  "...  was  launched  by  <Agent>“). 

Date 

can  occur  as  the  object  of  a  date-related  preposition. 
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Destination 

can  occur  as  the  object  of  a  directional  preposition. 

Flightsource 

can  occur  in  a  "flight" -type  sentence  as  the  object  of  "from". 

Inclination 

can  occur  in  a  ,,taunch"-type  sentence  as  the  object  of  "on". 

Infosource 

must  be  the  subject  of  a  verb  which  takes  a  sententiai  complement,  e.g. 
"announce  [that]",  "expect  [that]". 

Launchsite 

can  occur  in  a  "!aunch"-type  sentence  as  the  object  of  "from". 

Launchsys 

can  occur  in  a  "!aunch“-rype  sentence  as  the  adject  at  "by",  or  as  the  subject . 
Location 

can  occur  in  most  types  of  sentence,  as  the  object  of  a  directional  or  locative 
preposition  or  as  the  direct  object  of  certain  verbs  such  as  "reenter". 

Object 

can  take  the  place  of  a  noun  phrase  denoting  a  missile  or  satellite  in  the  sub¬ 
ject  or  direct  object  of  a  sentence. 

Revolution 

takes  the  place  of  a  noun  phrase  such  as  "revolution  1  23"  or  "the  45th  revolu¬ 
tion"  in  a  prepositional  phrase. 


Time 

can  occur  as  the  object  of  a  prepositional  phrase  appropriate  to  time. 

Slot  names  have  two  functions  in  querying:  first,  they  may  be  used  to  fill  obligator/ 
syntactic  roles  to  make  a  sentence  parsable,  as  in  the  first  example  sentence  m 
Section  1  (in  this  example,  the  user  presumabiy  wished  to  retrieve  any  ER  concern¬ 
ing  a  deorbiting  event  on  1 2  juty  19 79,  regardless  of  what  deorbited);  second,  they 
allow  the  user  to  specify  the  presence  of  slots  in  ERs  to  be  retrieved,  without  res¬ 
tricting  the  contents  of  those  slots. 

4.2.2  Processing  Open  Sentences  An  open  sentence  presented  as  a  query  will 
create  a  QR  with  the  same  structure  as  an  ER,  except  that  the  slot  filler  correspond¬ 
ing  to  a  slot  name  in  the  sentence  will  have  the  value  "[]",  which  will  be  taken  as 
matching  anything  in  the  corresponding  place  in  the  ER. 

To  extend  the  message  processing  portion  of  the  system  to  process  open  sentences 
and  generate  query  records  required  only  slight  extensions:  the  "morphology"  sec¬ 
tion  was  modified  to  recognize  slot  names  and  create  special  lexical  items  for  them; 
these  are  recognized  by  the  grammar  in  the  appropriate  places,  as  indicated  above; 
finally,  a  few  template  matching  predicates  had  to  be  extended  to  recognize  these 
items, 

4.3  Implementation 

A  generated  event  record  is  stored  essentially  as  a  set  of  slots,  with  mar*^rs  to 
cross-reference  slots  and  event  records.  This  scheme  allows  direct  matching  of 
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corresponding  ER  and  QR  slots.  An  ER  will  match  a  QR  if  and  only  if  all  QR  slots  match 
the  corresponding  ER  slots .  Thus,  the  QR  slots  may  be  a  "subset"  of  the  ER  slots, 
but  not  vice  versa.  This  scheme  was  chosen  as  an  a  priori  reasonable  approximation 
to  what  a  user  might  desire  to  retrieve  via  a  given  query. 
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5.0  FUTURE  DIRECTIONS 


MATRES  III,  as  an  embodiment  of  OSTs  automated  data  base  generation  technology, 
provides  a  clear  demonstration  of  the  validity  and  operational  feasibility  of  this  tech- 
nology.  It  is,  of  course,  a  limited  system,  designed  within  the  constraints  of  a  single 
process  running  on  a  timeshared  minicomputer,  and  written  in  a  programming  language 
which  has  been  designed  more  to  give  easy  expression  to  complex  algorithms  than  to 
fully  exploit  the  power  of  a  given  computer.  Nevertheless,  it  it  fully  functional  on  its 
limited  domain,  and  provides  an  excellent  base  for  further  development  of  the  mes¬ 
sage  processing  and  data  base  technology. 

In  this  section,  we  consider  the  possibilities  for  further  development  in  two  direc¬ 
tions:  first,  to  bring  the  current  technology  to  the  point  where  it  could  operate,  in  a 
production  environment,  as  a  front  end  to  a  message  content  analysis  system,  and 
could  handle  live  message  traffic  at  normal  (and  peak)  traffic  loads.  Second,  we  dis¬ 
cuss  the  development  directions  which  would  allow  the  system  to  operate  on  a  wider 
class  of  input,  and  with  more  power  in  its  automatic  processing. 

At  a  minimum,  a  production  version  of  the  system  should  also  be  able  to  extract 
header  information  that  could  be  useful  in  processing  message  text,  e.g.  date/time, 
source,  references,  content  identifiers.  This  technology  is  welt  understood,  and  is 
utilized  in  OSI's  message  handling  systems;  such  capabilities  could  thus  easily  be 
added  to  the  MATRES  design  whenever  a  production  system  development  is  required. 
A  production  version  of  the  system  must  also  be  able  to  deal  with  such  characteris¬ 
tics  of  live  traffic  as  misspellings,  unexpected  information,  totally  unprocessable 
sentences  (e.g.  comments  on  political  implications),  etc,,  without  losing  its  context. 
Although  much  of  this  could  be  automated  using  state  of  the  art  technology  (e.g., 
spelling  correction  systems),  a  human  partner  will  be  required  as  backup  to  handle 
the  intractable  material  in  the  foreseeable  future,  since  new  problems  will  be 
encountered  as  additional  subject  domains  and  capabilities  are  added  to  the  system. 
Integrating  a  human  partner  in  a  problem-solving  mode  would  require  a  sophisticated 
error  and  problem  diagnostic  capability,  as  well  as  a  complex  interaction  facility 
between  the  user  and  the  system’s  knowledge  base. 

Depending  on  the  requirements  of  the  content  analysis  system,  the  current  approach 
to  event  record  structuring  and  generation  might  require  considerable  modification.  It 
Is  possible  to  envision  two  different  approaches  such  modifications  might  take.  One 
approach  would  be  based  on  a  complete  analysis  of  every  sentence  in  each  mes¬ 
sage,  while  another  would  use  the  templates  to  drive  a  selective  analysis  of  key 
segments  of  the  messages,  searching  for  template-specific  types  of  content.  With 
the  first  approach,  a  traditional  data  base  system  would  place  considerable  require¬ 
ments  on  the  system  to  resolve  pronouns  and  other  referential  constructs,  as  well  as 
Inter-event  relationships,  to  represent  the  semantics  of  a  message  within  the  struc¬ 
tural  constraints  of  such  a  system.  The  second  approach  would  be  aimed  at  covering 
more  material  but  with  less  precision,  leaving  gaps  that  the  user  would  have  to  fill. 

Finally,  in  order  to  handle  realistic  traffic  loads,  the  system  may  well  need  to  be  pro¬ 
grammed  at  a  lower  level,  with  efficiency  as  a  prime  objective.  Indeed,  in  order  to 
handle  a  variety  of  reasonably  complex  event  types  with  acceptable  speed,  a  fast 
processor  with  a  large  primary  memory  could  be  required,  and  considerable  attention 
paid  to  efficient  representation  of  algorithms  and  data. 
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On  the  theoretical  aide,  we  have  taken  a  fairly  conservative  approach  to  date,  us-ng 
mostly  welt  established  techniques  of  parsing  and  semantic  interpretation,  with  a 
trivial  approach  to  morphology  and  lexical  problems.  Considerable  improvements  are 
possible  in  all  the  analytical  components  of  the  system.  Specifically,  within  areas: 

Morphology:  Currently,  we  make  no  attempt  at  a  real  morphology,  i.e.  using  morpho¬ 
logical  analyr*s  to  supplement  dictionary  lookup,  and  thus  reduce  dictionary  size 
and  simplify  the  job  of  extending  the  system’s  vocabulary.  In  addition,  military 
language  is  rife  with  abbreviations  and  terminology  whose  meaning  at  ieast  in 
part,  discernable  from  their  form.  Even  in  the  case  of  an  unknown  word,  a  good 
morphological  analysis  can  often  give  a  clue  to  its  syntactic  role  throtign  affix 
analysis. 

Syntax:  Our  current  algorithm  is  strictly  left-to-right  and  top-down.  In  addition  the 
grammar  is  essentially  coded  as  part  of  the  program.  In  the  effort  to  automati¬ 
cally  handle  mistyped  or  genuinely  new  words  or  phrases,  It  would  be  highly 
desirable  to  be  able  to  analyze  first  those  pieces  of  syntax  that  come  easiest, 
and  use  these  to  aid  in  parsing  the  remainder  of  the  sentence.  !n  addition, 
once  one  or  a  small  set  of  possible  templates  has  been  identified  as  applying  to 
the  sentence,  such  semantic  information  could  be  a  considerable  aid  to  pro¬ 
cessing  difficult  areas,  or  at  ieast  in  identifying  them  for  the  human  partner. 

Template  Filling:  As  mentioned  above,  the  template  selection  and  filling  process  could 
be  integrated  with  syntax  analysis  (and  possibly  even  morphology)  to  their 
mutual  benefit.  In  a  restricted  sublanguage  such  as  we  are  dealing  with,  con¬ 
siderable  sharing  can  be  done  between  syntactic  and  semantic  analysis. 

Text  notations:  We  have  currently  taken  only  a  small  step  toward  the  processing  of 
a  message  of  several  sentences  as  a  whole  unit,  rather  than  a  sequence  of 
Isolated  pieces.  At  least  within  a  message,  resolution  of  pronouns  and  other 
anaphora  could  be  performed  to  some  extent  using  extant  techniques.  Refer¬ 
ences  to  earlier  messages,  explicit  and  implicit,  would  require  cooperation 
between  the  event  record  generation  system  and  the  content  analysis  system 
to  which  it  interfaces. 

At  another  level  of  analysis,  work  should  be  done  to  systematize  the  parts  of  the 
system  that  are  specific  to  the  sublanguage  domain,  so  that  the  modification  of  the 
system  to  add  new  domains,  with  attendant  vocabulary,  syntax,  and  templates,  can 
be  done,  perhaps  with  some  guidance,  by  a  subject  expert.  This  ;*>pe  of  analysis, 
called  "knowledge  acquisition"  is  a  fairly  new  field  of  investigation,  but  is  beginning 
to  receive  considerable  attention. 
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Appendix  A  —  Sample  Sentences  from  MATRES  III  Corpus 
SKYLAB  BROKE  UP  IN  ORBIT. 

SKYLAB  HAS  BROKEN  UP  IN  ORBIT. 

BREAKUP  OF  SKYLAB  OCCURRED  OVER  THE  INDIAN  OCEAN 
NEAR  WESTERN  AUSTRALIA. 

BREAKUP  OF  SKYLAB  TOOK  PLACE  OVER  WESTERN  AUSTRALIA. 

BREAKUP  OF  SKYLAB  WAS  OVER  WESTERN  AUSTRALIA. 

SKYLAB,  THE  ORBITAL  WORKSHOP  WHICH  WAS  LAUNCHED 
FROM  THE  KENNEDY  SPACE  CENTER  ON  14  MAY  1973, 

HAS  BROKEN  UP  IN  ORBIT. 

THE  ORBITAL  WORKSHOP,  SKYLAB,  WHICH  WAS  LAUNCHED 
FROM  THE  KENNEDY  SPACE  CENTER  ON  1  4  MAY  1  973, 

BROKE  UP  AT  0900Z  ON  1  2  JUL  1  979. 

BREAKUP  OF  SKYLAB,  THE  ORBITAL  WORKSHOP  WHICH  WAS 

LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER 

ON  14  MAY  1973,  TOOK  PLACE  AT  0900Z  ON  12  JUL  1979. 

SKYLAB,  THE  81  DEGREE  PROBABLE  THIRD  GENERATION  SATELLITE, 

HAS  BEEN  CONFIRMED  IN  ORBIT. 

THE  APPROXIMATELY  8  SATELLITES  WHICH  WERE  LAUNCHED  FROM  CAIRO  ON  21  MAY  1988 
HAVE  BEEN  CONFIRMED  IN  ORBIT. 

THE  SATELLITE  WAS  CONFIRMED  IN  ORBIT  ON  REVOLUTION  ONE 
AT  0900Z  ON  31  MAY  1978. 

THE  SATELLITE  HAS  BEEN  CONFIRMED  IN  ORBIT  ON  REVOLUTION  ONE 
AT  0900Z  ON  3 1  MAY  19  71 

THE  SKVLAB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN-5  LAUNCH  VEHICLE, 

WAS  CONFIRMED  IN  ORBIT  AT  ABOUT  1  700Z  ON  REVOLUTION  ONE. 

SKYLAB,  THE  81  DEGREE  PROBABLE  THIRD  GENERATION  SATELLITE 
LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER  AT  1  900Z  ON  6  MAR  1971 
BY  A  SATURN-5  TYPE  LAUNCHER, 

HAS  3EF.N  CONFIRMED  N  ORBIT  ON  REVOLUTION  ONE. 

SKYLAB,  LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER 
AT  1900Z  TOOAY,  HAS  BEEN  CONFIRMED  IN  ORBIT 
ON  REVOLUTION  ONE. 


SKYUB,  THE  81  DEGREE  PROBABLE  THIRD  GENERATION  SATELLITE, 

LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER  EARLIER  TODAY  AT  1  300Z, 

HAS  BEEN  CONFIRMED  IN  EARTH  ORBIT  ON  REVOLUTION  ONE. 

THE  SOVIET  NEWS  AGENCY  TASS  ANNOUNCED  THAT  SKYLAB  OEORBITED  INTO 
THE  INDIAN  OCEAN  ON  THE  34981  ST  REVOLUTION. 

THE  SOVIET  NEWS  AGENCY  TASS  ANNOUNCED  THAT  THE  DEORBIT  OF  SKYLAB 
PROBABLY  OCCURRED  OVER  CANADA  EARLY  ON  REVOLUTION  34981 

THE  DEORBIT  OF  SKYLAB  WAS  OVER  CANADA  ON  12  JULY  19  79. 

DEOR8IT  TOOK  PLACE  INTO  THE  AUSTRALIAN  OUTBACK. 

DEORBIT  TOOK  PUCE  AT  1900Z  ON  1  2  JULY  1979. 

THE  DEORBIT  OF  SKYUB  OVER  CANADA  TOOK  PLACE  ON  12  JULY  1979. 

THE  DEORBIT  OF  SKYUB  WAS  DURING  THE  INITIAL  PORTION  OF  REVOLUTION 
34981. 

THE  SPACECRAFT  WAS  OEORBITED  DURING  ITS  12STH  REVOLUTION 
AT  1330  HOURS  THIS  DATE. 

THE  DEORBIT  OF  SKYUB  OCCURRED  DURING  THE  EARLY  PORTION  OF 
REVOLUTION  34981. 

THE  DEORBIT  OF  SKYUB  WAS  PROBABLY  OVER  CANADA  ON  12  JULY  1979. 

THE  SKYUB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN-6  UUNCH  VEHICLE,  WAS  DEORBITED  INTO  THE  INDIAN  OCEAN 
ON  12  JULY  1979. 

THE  SKYUB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN-6  UUNCH  VEHICLE,  SUCCESSFULLY  DEORBITED  INTO  THE  AUSTRALIAN 
OUTBACK  ON  1 2  JULY  1 979. 

THE  SPACECRAFT  DEORBITED  ON  THE  SAME  DAY. 

THE  SATELLITE  WAS  DEORBITED  EARLY  ON  REVOLUTION  125. 

THE  SATELLITE  WAS  DEORBITED  INTO  THE  INDIAN  OCEAN. 

NASA  DEORBITED  THE  SATELLITE  INTO  THE  INDIAN  OCEAN  ON  THE  EARLY 
PORTION  OF  REVOLUTION  1  45. 

THE  SATELLITE  WAS  DEORBITED  INTO  THE  INOIAN  OCEAN  EARLY  ON  REVOLUTION  ONE. 
DEORBIT  OF  SKYUB  OCCURRED  OVER  CANADA  EARLY  ON  REVOLUTION  34981. 


DEORBIT  TOOK  PLACE  EARLY  ON  REVOLUTION  34981. 


NASA  DEORBITED  THE  SATELLITE  INTO  THE  INDIAN  OCEAN  ON  REVOLUTION  123. 

I  HE  SATELLITE  WAS  DEORBITED  BY  NASA  INTO  THE  INDIAN  OCEAN  ON 
REVOLUTION  123, 

THF  SATELLITE  WAS  EXPECTED  TO  DEORBIT  INTO  THE  INDIAN  OCEAN. 

DEORBIT  WAS  EXPECTED  TO  OCCUR  IN  THE  INDIAN  OCEAN. 

DEORBIT  OF  THE  SATELLITE  WAS  EXPECTED  TO  OCCUR  IN  THE  INDIAN  OCEAN. 

DEORBIT  OF  THE  SATELLITE  FAILED  TO  OCCUR  IN  THE  INDIAN  OCEAN. 

THE  SATELLITE  FAILED  TO  DEORBIT  INTO  THE  INDIAN  OCEAN. 

THE  I  CBM  LAUNCHED  FROM  THE  PLACENAME  MISSILE  TEST  RANGE 
AT  APPROXIMATELY  1330  HOURS  ON  13  MAY 
FLEW  TO  THE  NOMINAL  100  KM  IMPACT  AREA  NEAR  THE 
PLACENAME  MISSILE  TEST  CENTER. 

AN  ICBM  WAS  SUCCESSFULLY  FLOWN  TO  THE  PLACENAME  AREA. 

NASA  ANNOUNCED  THAT  THE  IMPACT  OF  SKYLAB  TOOK  PUVCE  IN  AN  AREA 
ABOUT  500  MILES  FROM  PERTH  IN  WESTERN  AUSTRALIA. 

THE  SKYLAB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN-5  LAUNCH  VEHICLE,  IMPACTED  IN  WESTERN  AUSTRALIA  JUST 
NORTHEAST  OF  KALGOORLIE  ON  1  2  JULY  1  979. 

THE  VEHICLE  IMPACTED  NEAR  KALGOORLIE  IN  THE  AUSTRALIAN  OUTBACK. 

IMPACT  OF  SKYLAB  OCCURRED  IN  WESTERN  AUSTRALIA  JUST  NORTHEAST  OF 
KALGOORLIE  ON  12  JULY  1979. 

IMPACT  OCCURRED  INTO  THE  NORMAL  RECOVERY  AREA. 

IMPACT  OCCURRED  AT  1  900Z  ON  12  JULY  1979. 

IMPACT  IN  WESTERN  AUSTRALIA  WAS  NEAR  KALGOORLIE. 

THE  IMPACT  OF  SKYLAB  TOOK  PLACE  IN  THE  AUSTRALIAN  OUTBACK. 

IMPACT  OF  SKYLAB  WAS  ABOUT  500  MILES  NORTHEAST  OF  PERTH  IN  WESTERN 
AUSTRALIA. 

IMPACT  WAS  ABOUT  10  NM  OUTSIDE  OF  THE  NORMAL  RECOVERY  AREA. 

THE  MISSILE  IMPACTED  NEAR  THE  NORMAL  RECOVERY  AREA  AT  1900 Z  ON  13 
APR  ^999. 
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IMPACT  PROBABLY  OCCURRED  IN  THE  AUSTRALIAN  OUTBACK  JUST  NORTHEAST 
OF  THE  CENTRAL  PORTION  OF  THE  RECOVERY  AREA. 


IMPACT  OF  SKYLAB  IN  WESTERN  AUSTRALIA  WAS  ON  12  JULY  1979. 

IMPACT  IN  AN  AREA  JUST  NORTHEAST  OF  KALGOORLIE  IN  WESTERN  AUSTRALIA 
WAS  ON  12  JULY  1979. 

IMPACT  WAS  PROBABLY  IN  AN  AREA  ABOUT  500  KMS  FROM  PERTH  IN 
WESTERN  AUSTRALIA  ON  12  JULY  1979. 

IMPACT  PROBABLY  OCCURRED  IN  AN  AREA  ABOUT  100  KMS  "ROM  KALGOORLIE. 

IMPACT  OF  SKYLAB  WAS  PROBABLY  IN  AN  AREA  ABOUT  500  KMS  NORTHEAST 
OF  PERTH. 

IT  IMPACTED  IN  WESTERN  AUSTRALIA  NEAR  KALGOORLIE  AT  1420Z. 

A  PROBABLE  NAVAL  ICBM  WAS  LAUNCHED  FROM  THE  INDIAN  OCEAN 
MISSILE  COMPLEX  TO  THE  INDIAN  OCEAN  EXTENDED  RANGE  IMPACT  AREA 
AT  1900Z,  ON  23  MAY  1971. 

THE  SKYLAB  ORBITAL  WORKSHOP.  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN-6  LAUNCH  VEHICLE,  WAS  LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER 
AT  ABOUT  1  330  HOURS  ON  1  4  MAY  1  973. 

THE  SKYLAB  ORBITAL  WORKSHOP,  A  CONVERTED  S-48  THIRD  STAGE  FROM  A 
SATURN-6  LAUNCH  VEHICLE,  WAS  FIRED  FROM  THE  KENNEDY  SPACE  CENTER 
AT  APPROXIMATELY  1330  HOURS  ON  14  MAY  1973. 

AT  APPROXIMATELY  1330  HOURS  ON  14  MAY  1973,  THE  SKYLAB  ORBITAL 
WORKSHOP  WAS  FIRED  FROM  THE  KENNEDY  SPACE  CENTER  8Y  A  SATURN-S 
TYPE  LAUNCHER. 

AT  1320Z,  A  DEFENSIVE  MISSILE,  POSSIBLY  A  SAM,  WAS  LAUNCHED  FROM 
THE  MISSILE  TEST  CENTER. 

AN  UNIDENTIFIED  MISSILE,  PERHAPS  A  DRONE,  WAS  FIRED  FROM  THE 
TEST  CENTER  AT  1320Z  ON  15  MAY  1971. 

THE  SKYLAB  ORBITAL  WORKSHOP  WAS  LAUNCHED  FROM  THE  KENNEDY  SPACE 
CENTER  AT  APPROXIMATELY  1330  HOURS  ON  14  MAY  1973  BY  A  SATURN-5 
TYPE  LAUNCHER. 

THE  SATELLITE  WAS  LAUNCHED  ON  A  67  DEGREE  ORBITAL  INCLINATION. 

THE  LAUNCH  OF  SKYLAB  TOOK  PLACE  ON  14  MAY  1973. 

THE  SOVIET  NEWS  AGENCY,  TASS.  ANNOUNCED  THAT  THE  LAUNCH 
OF  SKYLAB  TOOK  PLACi.  ON  14  MAY  1973. 
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THE  SOVIET  NEWS  AGENCY,  TASS,  ANNOUNCED  THE  LAUNCHING  OF  A 
MANNED  SATELLITE. 

IT  WAS  LAUNCHED  AT  1  320Z  TODAY  FROM  THE  KENNEDY  SPACE  CENTER. 

SKYLAB  REENTERED  THE  EARTH’S  ATMOSPHERE  OVER  CANADA  ON  1  2  JUL  1  979. 

REENTRY  OCCURRED  IN  THE  KALGOORLIE  REGION  AT  ABOUT  1900  HOURS 
ON  12  jUL  1979. 

REENTRY  OF  SKYLAB  TOOK  PLACE  OVER  CANADA  ON  21  JUL  1979. 

THE  MISSILE  REENTERED  OVER  WESTERN  AUSTRALIA  IN  THE 
GENERAL  VICINITY  OF  KALGOORLIE  ABOUT  1 0  MINUTES  AFTER  LIFT-OFF. 

THE  SPACECRAFT  REENTERED  THE  EARTH’S  ATMOSPHERE  AT  1  330Z  IN  THE 
GENERAL  VICINITY  OF  9999N9999E. 

AN  UNIDENTIFIED  MISSILE,  LAUNCHED  FROM  THE  PLACENAME  TEST  RANGE  AT 
1900  HOURS  TODAY,  REENTERED  IN  THE  NORMAL  RECOVERY  AREA. 

IMPACT  OCCURRED  AT  THE  SAME  TIME. 

IMPACT  OCCURRED  EARLIER  TODAY. 

IMPACT  OCCURRED  AT  ABOUT  1230Z  THIS  DATE. 

IMPACT  OCCURRED  AT  ABOUT  1230Z  ON  20  MAY  1971. 

IMPACT  OCCURRED  AT  ABOUT  1230  HOURS  ON  20  MAY  1971. 

IMPACT  OCCURRED  ABOUT  1 9  MINUTES  LATER 

AT  A  POINT  NEAR  1530N2S20E  SOME  50  MILES  DOWNRANGE. 

IMPACT  TOOK  PLACE  IN  THE  PLACENAME 
ABOUT  1  0  MINUTES  LATER. 

IMPACT  WAS  ABOUT  50  NAUTICAL  MILES  OUTSIDE  OF  THE 
NORTHWESTERN  EDGE  OF  THE  NORTHERN  BROAD  OCEAN  AREA 
CIRCULAR  CLOSURE. 

IMPACT  OCCURRED  IN  CANADA  IN  THE  GENERAL  VICINITY  OF 
OOOONOOOOW  ABOUT  1  0  MINUTES  AFTER  LIFT-OFF. 

THE  MISSILE  IMPACTED  ON  PLACENAME  ABOUT  1 0  MINUTES  AFTER  LAUNCH. 

THE  MISSILE  REENTERED  IN  THE  NORMAL  RECOVERY  AREA 
ABOUT  1  0  MINUTES  AFTER  LAUNCH  AT  1 33CZ. 
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Appendix  B  —  Sample  Event  Record  Outputs  from  MATRES  ill 


skyiab  broke  up  in  orbit  . 

Event:  breakup 
Object:  satellite 
Equipment^  skyiab 
Quantity* 

Locations  in  orbit 

Times 

Dates 


skyiab  has  broken  up  in  orbit  . 

Event:  breakup 
Object:  satellite 
Equipment*  skyiab 
Quantity* 

Location*  in  orbit 
Time* 

Date* 


breakup  of  skyiab  occurred  over  the  Indian  ocean  near  western  australis  . 

Event:  breakup 
Object:  satellite 
Equipment*  skyiab 
Quantity* 

Location*  over  the  indian  ocean  near  western  Australia 
Time* 

Date* 


breakup  of  skyiab  took  place  over  western  australia  . 

Event;  breakup 
Object:  satellite 
Equipment*  skyiab 
Quantity* 

Location*  over  western  australia 
Time* 

Date* 


breakup  of  skyiab  was  over  wmstern  australia  . 


Event:  breakup 
Object:  satellite 
Equipment^  skylab 
Quantity* 

Location*  over  western  australla 
Time* 

Date* 


skylab  ,  the  orbital  workshop  which  was  launched  from  the  kennedy  space 
center  on  1 4  may  1973,  has  broken  up  in  orbit  . 

Event:  breakup 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Add_  Attributes*  the  orbital  workshop 
Rel^  event* 

Event:  launch 
Agent*  agentunk 
Object:  satellite 

Equipment*  orbital  workshop 
Quantity* 

Launchsite*  from  the  kennedy  space  center 
Destination* 

Time* 

Date*  on  1 4  may  1973 
Location*  in  orbit 
Time* 

Date* 


the  orbital  workshop  ,  skylab  ,  which  was  launched  from  the  kennedy  space 
center  on  14  may  1973  ,  broke  up  at  0900  on  12  jul  1979  . 

Event:  breakup 
Object:  satellite 

Equipment*  orbital  workshop 
Quantity* 

Add..  Attributes*  skylab 
Rei_  event* 

Event:  launch 
Agent*  agentunk 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Launchsite*  from  the  kennedy  space  center 
Destination* 

Time* 

Date*  on  1 4  may  1 973 
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Location® 

Times  at  0900  hours 
Dates  on  1 2  jul  1 979 


breakup  of  skyiab  ,  the  orbitai  workshop  which  was  launched  from  the  kennedy 
space  center  on  14  may  1973  ,  took  place  at  0900  on  12  jul  1979  . 

Event:  breakup 
Object:  satellite 
Equipments  skyiab 
Quantity3 

Add_ Attributes3  the  orbitai  workshop 
Rel__  event3 
Event:  launch 
Agent3  agentunk 
Object:  satellite 

Equipment3  orbital  workshop 
Quantity3 

Launchsite3  from  the  kennedy  space  center 

Destination3 

Time3 

Date3  on  1 4  may  1 973 
Location3 

Time3  at  0900  hours 
Date3  on  12  jul  1979 

skyiab  ,  the  81  degree  probable  third  generation  sateilite  ,  has  been 
confirmed  in  orbit  . 

Event:  confirm 
Object:  satellite 
Equipment3  skyiab 
Quantity3 

Add__ Attributes3  the  81  degree  probable  third  generation  satellite 
Location3  in  orbit 
Revolution3 
Time3 
Date3 

the  approximately  8  satellites  which  were  launched  from  cairo  on  21  may  1988 
have  been  confirmed  in  orbit  . 

Event:  confirm 
Object:  sateilite 

Equipment3  satellites 
Quantity3  approximately  8 
Rel_  event* 
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Event:  launch 
Agents  agentunk 
Object:  satellite 

Equipments  satellites 
Quantity*  approximately  8 
Launchsites  from  cairo 
Destinations 
Times 

Date*  on  21  may  1  988 
Location*  in  orbit 
Revolution* 

Time* 

Date* 


the  satellite  was  confirmed  in  orbit  on  revolution  one  at  0900  on  31  may  1978. 

Event:  confirm 
Object:  satellite 
Equipment*  satellite 
Quantity3 
Locations  in  orbit 
Revolutions  on  revolution  one 
Times  at  0900  hours 
Da£e*  on  31  may  1978 


the  satellite  has  been  confirmed  in  orbit  on  revolution  one  at  0900  on  31  may 
1971  . 

Event:  confirm 
Object:  satellite 
Equipment*  satellite 
Quantity* 

Locations  in  orbit 
Revolution*  on  revolution  one 
Time*  at  0900  hours 
Date*  on  31  may  1971 


the  skylab  orbital  workshop  ,  a  converted  s-4b  third  stage  from  a  saturn-5 
launch  vehicle  ,  was  confirmed  in  orbit  at  about  1  700  on  revolution  one  . 

Event:  confirm 
Object:  satellite 

Equipment*  skylab  orbital  workshop 
Quantity* 

Add— Attributes*  a  converted  s-4b  third  stage  from  a  saturn-5  launch 
vehicle 

Locations  in  orbit 
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Revolutions  on  revolution  one 
Time*  at  about  1700  hours 
Date* 


skylab  ,  the  81  degree  probable  third  generation  satellite  launched  from  the 
kennedy  space  center  at  1900  on  6  mar  1971  by  a  satum-5  type  launcher  ,  has 
been  confirmed  in  orbit  on  revolution  one  . 

Event:  confirm 
Object:  satellite 
Equipments  skylab 
Quantity® 

Add__ Attributes®  the  81  degree  probable  third  generation  satellite 
ReL_  events 
Event:  launch 
Agent=  agentunk 
Object:  satellite 
Equipment®  skylab 
Quantity® 

Launchsys®  by  a  satum-6  type  launcher 
Launchsite®  from  the  kennedy  space  center 
Destinations 
Tlme=  at  1900  hours 
Date®  on  Q  mar  1971 
Locations  in  orbit 
Revolution®  on  revolution  one 
Time® 

Date® 


skylab  ,  launched  from  the  kennedy  space  center  at  1900  today  ,  has  been 
confirmed  in  orbit  on  revolution  one  . 

Event:  confirm 
Object:  satellite 
Equipment®  skylab 
Quantity® 

Relevant® 

Event:  launch 
Agent®  agentunk 
Object:  satellite 
Equipments  skylab 
Quantity® 

Launchsite®  from  the  kennedy  space  center 
Destination® 

Time®  at  1 900  hours 
Date®  today 
Location*  in  orbit 
Revolution*  on  revolution  one 
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Tlme= 

Dates 


skylab  ,  the  81  degree  probable  third  generation  satellite  ,  launched  rrom 
the  kennedy  space  center  earlier  today  at  1 300  ,  has  been  confirmed  in  earth 
orbit  on  revolution  one  . 

Event:  confirm 
Object:  satellite 
Equipment®  skylab 
Quantity* 

Add_  Attributes*  the  81  degree  probable  third  generation  satellite 
Rel_  event* 

Event:  launch 
Agent*  agentunk 
Object:  satellite 

Equipment*  81  degree  probable  third  generation  satellite 
Quantity* 

Launchsite*  from  the  kennedy  space  center 
Destination* 

Time*  earlier  at  1  300  hours 
Date*  today 
Location*  in  earth  orbit 
Revolution*  on  revolution  one 
Time* 

Date* 


the  soviet  news  agency  tass  announced  that  skylab  deorblted  into  the  Indian 
ocean  on  the  34981  revolution  . 

Infosource*  the  soviet  news  agency  tass  announce 
Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location*  into  the  indian  ocean 
Revolution*  on  the  34981  revolution 
Time* 

Date* 


the  soviet  news  agency  tass  announced  that  the  deorbit  of  skylab  probably 
occurred  over  Canada  early  on  revolution  34981  . 

Infosource*  the  soviet  news  agency  tass  announce 
Status*  probably 
Event:  deorbit 
Object;  satellite 
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Equipment*  skylab 
Quantity* 

Location*  over  Canada 

Revolution*  early  on  revolution  34961 

Time* 

Date* 


the  deorbit  of  skylab  wa a  over  Canada  on  1  2  july  1979. 

Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location*  over  Canada 
Revolution* 

Time* 

Date*  on  12  july  1979 


deorbit  took  place  into  the  austraiian  outback  , 
Event:  deorbit 

Location*  into  the  auatralian  outback 
Revolution* 

Time* 

Date* 


deorbit  took  place  at  1900  on  12  july  1979  . 

Event:  deorbit 
Location* 

Revolution* 

Time*  at  1 900  hours 
Date*  on  12  july  1979 


the  deorbit  of  skylab  over  Canada  took  place  on  12  july  1979  . 

Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location*  over  Canada 
Revolution* 

Time* 

Date*  on  1  2  july  1 979 
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the  deorbit  of  skylab  was  during  the  initial  portion  of  revolution  34981  . 

Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location* 

Revolution*  during  the  initial  portion  of  revolution  34981 


the  spacecraft  was  deorbited  during  its  125  revolution  at  1330  Hours  this 
date 

Event:  deorbit 
Object:  satellite 

Equipment*  spacecraft 
Quantity* 

Agent*  agentunk 
Location* 

Revolution*  during  its  125  revolution 
Time*  at  1 330  hours 
Date*  this  date 


the  deorbit  of  skylab  occurred  during  the  early  portion  of  revolution  34981  . 

Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location* 

Revolution*  during  the  early  portion  of  revolution  34981 


the  deorbit  of  skylab  was  probably  over  Canada  on  12  july  1979 

Event:  deorbit 
Object:  satellite 
Equipment*  skylab 
Quantity* 

Location*  probably  over  Canada 
Revolution* 

Time* 

Date*  on  1  2  july  1979 


the  skylab  orbital  workshop  ,  a  converted  s-4b  third  stage  from  a  saturn-5 
launch  vehicle  ,  was  deorbited  into  the  indian  ocean  on  1  2  july  1979 


Event:  deorbit 


? 


Object:  satellite 

Equipment*  skylab  orbital  workshop 
Quantity* 

Add_ Attributes*  a  converted  s-4b  third  stage  from  a  aaturn-5  launch  vehicle 
Agent*  agentunk 
Location*  into  the  Indian  ocean 
Revolution* 

Time* 

Date*  on  12  july  1979 


the  skylab  orbital  workshop  ,  a  converted  s-4b  third  stage  from  a  aaturn-5 
launch  vehicle  ,  successfully  deorbited  into  the  australian  outback  on  1  2  july  1979 

Status*  successfully 
Event-  deorbit 
Object,  satellite 

Equipment*  skylab  orbital  workshop 
Quantity* 

Add_  Attributes*  a  converted  s-4b  third  stage  from  a  saturn-6  launch  vehicle 
Location*  into  the  australian  outback 
Revolution* 

Time* 

Date*  on  1  2  july  1  979 


the  spacecraft  deorbited  on  the  same  day  . 

Event,  deorbit 
Object:  satellite 

Equipment*  spacecraft 
Quantity* 

Location* 

Revolution* 

Time* 

Date*  on  the  same  day 


iha  satellite  was  deorbited  early  on  revolution  125  . 

Event:  deorbit 
Object:  satellite 
Equipment*  satellite 
Quantity* 

Agent*  agentunk 
Location* 

Revolution*  early  on  revolution  1 25 


the  satellite  was  deorbited  into  the  Indian  ocean  . 


Event  deorbit 
Object  satellite 

Equipment*  satellite 
Quantity* 

Agent*  agentunk 

Location*  into  the  indlan  ocean 

Revolution* 

Time* 

Oate* 


aeorbited  the  satellite  into  the  indian  ocean  on  the  early  portion  of 

v'.r:!it’on  i  45 

yen!  deoroit 
Object:  satellite 

Equipment*  satellite 
uantity* 
agt*nt*  nasa 

a;, on*  into  the  indian  ocean 
Revolution*  on  the  early  portion  of  revolution  145 
f  tine* 

Oate* 


deorb«ted  into  the  indlan  ocean  early  on  revolution  one  . 

Event,  leorb't 

ucr  sa*  "Jlta 
cTuipn>e ■:;*  satellite 
Juan  :.v  /* 

A^ent*  agentunk 
.ocation*  <nto  the  indlan  ocean 
Revolution*  early  on  revolution  one 
r  ime* 

Date* 


■jeortvt  of  skyiao  occurred  over  Canada  early  on  revolution  3498" 

Event  deorbit 
Object  satellite 
Equipment*  skylab 
Quantity* 

vocation*  over  Canada 

Revolution*  early  on  revolution  34981 

Tmn* 

Oate* 
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deorbit  took  place  early  on  revolution  34981  . 


Event:  deorbit 
Locations 

Revolutions  early  on  revolution  34981 


nasa  deorbited  t be  satellite  into  the  Indian  ocean  on  revolution  123  . 

Event:  deoroit 
Object:  satellite 

Equipments  satellite 
Quantity3 
Agent55  nasa 

Locations  into  the  Indian  ocean 
Revolutions  on  revolution  1  23 
Time3 
Date3 


the  satellite  was  deorbited  by  nasa  into  the  indlan  ocean  on  revolution  1  23  . 

Event:  deorbit 
Object:  satellite 

Equipments  sateJJite 
Quantity3 
Agent3  nasa 


the  satellite  was  expected  to  deorbit  into  the  indlan  ocean  . 

Status3  expect 
Event:  deorbit 
Object:  satellite 

Equipment3  satellite 
Quantity3 

Location3  into  the  indlan  ocean 

Revolution3 

Tlme= 

Date3 


deorbit  was  expected  to  occur  in  the  indlan  ocean  . 

Status3  expect 
Event:  deorbit 

Location3  in  the  indlan  ocean 

Revolution3 

Time® 

Date* 
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deorOit  of  the  satellite  was  expected  to  occur  in  the  indian  ocean  . 


3tatus=  expect 
Event:  deorbtt 
Object:  satellite 

Equipments  satellite 
Quantity= 

Location=  in  the  indian  ocean 
Revolution^ 

Time= 

Dates 


deorbit  of  the  satellite  failed  to  occur  in  the  indian  ocean  . 

Statuss  fail 
Event:  deorbit 
Object,  satellite 

Equipments  satellite 
Uuantity= 

Locations  jn  the  indian  ocean 

Revolutions 

Time  = 

Dates 


the  satellite  failed  to  deorbit  into  the  indian  ocean  . 

Statuss  rap 
Event.  deorOit 
Object:  satellite 

Equipments  satellite 
Quantitys 

Locations  into  the  indian  ocean 

Revolutions 

Times 

Dates 


0*1  2 


,t>  c*> 


MISSION 

of 

Rome  Air  Development  Center 

RAVC  plans  and  executes  res Mich,  development,  test  and 
defected  acquisition  programs  in  support  of.  Command ,  Control 
Communications  and  Intelligence  |C3I)  activities.  Technical 
and  engineering  support  within  areas  of  technical  competence 
is  provided  to  ESt>  Program  Offices  ( PCs )  and  other  ESV 
elements.  The  principal  technical  mission  areas  are 
communications,  electromagnetic  guidance  and  control,  sur¬ 
veillance  of  ground  and  aerospace  objects,  intelligence  data 
collection  and  handling,  information  system  technology, 
ionospheric  propagation,  solid  state  sciences,  microwave 
physics  and  electronic  reliability,  maintainability  and 
compatibility . 
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